All the data you need.

Tag: Statistics

Johns Hopkins providing course on using epidemiology to understand the Covid-19 numbers
The numbers are fuzzy. You take them at face value, and you end…Tags: course, epidemiology, Johns Hopkins
Communicating a crisis
David Spiegelhalter on communicating a crisis: There are some basic principles, which I…Tags: communication, crisis, David Spiegelhalter
R lang : handy table functions
A couple of handly functions to work with table in R Studio.
Simple clinical trial of four COVID-19 treatments
A story came out in Science yesterday saying the World Health Organization is launching a trial of what it believes are the the four most promising treatments for COVID-19 (a.k.a. SARS-CoV-2, novel coronavirus, etc.) The four treatment arms will be Remdesivir Chloroquine and hydroxychloroquine Ritonavir + lopinavir Ritonavir + lopinavir …
Understanding data and statistics in the medical literature
Jeffrey Leek, Lucy D’Agostino McGowan, and Elizabeth Matsui have a free/ pay-what-you-want book…Tags: academic, book
Possible coronavirus deaths compared against other causes
Based on estimates from public health researcher James Lawler, The Upshot shows the…Tags: coronavirus, mortality, uncertainty, Upshot
Statistical Variability (Standard Deviation, Percentiles, Histograms)
In my previous article about statistical averages, we discussed how you can describe your dataset with a few central values (mean, median and mode). That’s well and good…... The post Statistical Variability (Standard Deviation, Percentiles, Histograms) appeared first on Data36.
All data is wrong
Vicki Boykis riffing off the George Box quote, “All models are wrong, some…Tags: uncertainty, Vicki Boykis
Dataset as worldview
Hannah Davis works with machine learning, which relies on an input dataset to…Tags: bias, Hannah Davis
Automatic data reweighting
Suppose you are designing an autonomous system that will gather data and adapt its behavior to that data. At first you face the so-called cold-start problem. You don’t have any data when you first turn the system on, and yet the system needs to do something before it has accumulated …
Testing Gmail’s tab choices on presidential candidates’ emails
For many, Gmail automatically categorizes incoming emails to the primary inbox, promotions, and…Tags: election, Gmail, Google, Guardian, The Markup
Federal budget scaled to per person dollars
For The Upshot, Alicia Parlapiano and Quoctrung Bui scaled down the federal budget…Tags: Alicia Parlapiano, budget, Quoctrung Bui, scale, Upshot
K-Nearest Neighbors explained
In this post, I explain the intuition and logic behind KNN algorithm and show simple implementation written in pure pandas which yield 98% accuracy on the IRIS dataset.
Billionaire’s spending scaled to your net worth
We hear about billionaires spending millions of dollars on ads, acquisitions, etc. It…Tags: money, scale, Washington Post, wealth
Data problems in Iowa caucus results
It wasn’t just issues with an app. There appears to be many more…Tags: caucus, error, Iowa, Upshot
Privacy algorithm could lead to Census undercount of small towns
To increase anonymity in the Census records, the bureau is testing an algorithm…Tags: census, counting, New York Times, privacy
Effect of College Selection on ROI
Description The current project applied web scraping techniques to investigate effect of degree type, college type, college major, and regionality, on early- and mid-career earnings and estimated 20 year return on investment of higher education Background College tuition is at an all-time high, and a barrier to entry for many …
What is a Chi-Square Test and Why Do We use it?
This post will give you an insight on how Chi-Square tests work and when to use them.