All the data you need.

Tag: Statistics

Three flavors of data scientist
As the field grows and needs develop throughout companies, specialization in data science…Tags: Airbnb, data science
Finding the Beatle who wrote each song using statistical models
There’s been some disagreement about who wrote “In My Life” by The Beatles,…Tags: Beatles, music
Finding the Beatle who wrote each song using statistical models
There’s been some disagreement about who wrote “In My Life” by The Beatles,…Tags: Beatles, music
How to Code the Student’s t-Test from Scratch in Python
Perhaps one of the most widely used statistical hypothesis tests is the Student’s t test. Because you may use this test yourself someday, it is important to have a deep understanding of how the test works. As a developer, this understanding is best achieved by implementing the hypothesis test yourself …
Calculating wind drag in the cycling peloton
When cyclists ride in that big pack during a race — the peloton…Tags: cycling, peloton, Wall Street Journal
Calculating wind drag in the cycling peloton
When cyclists ride in that big pack during a race — the peloton…Tags: cycling, peloton, Wall Street Journal
How to Calculate McNemar’s Test to Compare Two Machine Learning Classifiers
The choice of a statistical hypothesis test is a challenging open problem for interpreting machine learning results. In his widely cited 1998 paper, Thomas Dietterich recommended the McNemar’s test in those cases where it is expensive or impractical to train multiple copies of classifier models. This describes the current situation …
When wife earns more than husband, they report a lesser gap
Marta Murray-Close and Misty L. Heggeness for the Census Bureau compared income responses…Tags: census, income
When wife earns more than husband, they report a lesser gap
Marta Murray-Close and Misty L. Heggeness for the Census Bureau compared income responses…Tags: census, income
Analysis: Do the shoes matter in marathon running?
Kevin Quealy and Josh Katz for The Upshot analyzed shoe and running data…Tags: Nike, running, shoes, Upshot
Analysis: Do the shoes matter in marathon running?
Kevin Quealy and Josh Katz for The Upshot analyzed shoe and running data…Tags: Nike, running, shoes, Upshot
The Role of Randomization to Address Confounding Variables in Machine Learning
A large part of applied machine learning is about running controlled experiments to discover what algorithm or algorithm configuration to use on a predictive modeling problem. A challenge is that there are aspects of the problem and the algorithm called confounding variables that cannot be controlled (held constant) and must …
Millions of internet-connected TVs track viewing habits
Sapna Maheshwari for The New York Times on Samba TV software running on…Tags: privacy, television
Millions of internet-connected TVs track viewing habits
Sapna Maheshwari for The New York Times on Samba TV software running on…Tags: privacy, television
Neural networks to communicate with Alexa devices using sign language
Many have found Amazon’s Alexa devices to be helpful in their homes, but…Tags: Alexa, neural network, sign language, TensorFlow
Neural networks to communicate with Alexa devices using sign language
Many have found Amazon’s Alexa devices to be helpful in their homes, but…Tags: Alexa, neural network, sign language, TensorFlow
All of Statistics for Machine Learning
A foundation in statistics is required to be effective as a machine learning practitioner. The book “All of Statistics” was written specifically to provide a foundation in probability and statistics for computer science undergraduates that may have an interest in data mining and machine learning. As such, it is often …
Changing Twitter, with Statistics
Earlier this year, The New York Times investigated fake followers on Twitter showing…Tags: fake, New York Times, Twitter