Models for structuring big-data and data-analytics projects typically start with a definition of the project’s goals and the business value they are expected to create. The…
How to get into Python, the most widely used programming language for data science.
I am a big fan of the Vega-Altair ecosystem for data visualization, because it not only helps me in creating appealing, interactive visualizations, but it also helps me to…
I co-authored a peer-reviewed paper that compares the different modeling approaches for analytical problem solving.
By aggregating our data in an effort to simplify it, we lose the signal and the context we need to make sense of what we’re seeing. Originally published on <a…
We define the Predictive Power Score (PPS), an alternative to the correlation that finds more patterns in your data. Originally published on <a…
An introduction to performance metrics for binary classification. Originally published on <a…