1. Introduction The aim of the project is to find the algorithm that can best predict housing prices in Ames, a city in the State of Iowa. Ames is a growing city in Iowa that has experienced some recent real estate developments. This project intends to provide insight that can …
For The New York Times, Kashmir Hill describes the implications of facial recognition…Tags: Clearview, facial recognition, New York Times, privacy
An anonymous source supplied BuzzFeed News with usage data from Clearview AI, the…Tags: Clearview, facial recognition, police, privacy
Dan Bouk and Danah Boyd wrote an essay on the data infrastructure and…Tags: census, counting
The Data Journalism Handbook: Towards a Critical Data Practice now has a second…Tags: book, data journalism
For The Pudding, Lars Verspohl provides an introduction to statistical models disguised as…Tags: modeling, Pudding, wine
March 23, 2021, 3:51 p.m.
Reviewing Deborah Stone’s Counting and Tim Harford’s The Data Detective, Hannah Fry discusses…Tags: Hannah Fry, limitations
March 22, 2021, 6:39 p.m.
For ProPublica, Ken Schwencke reports on a poor data system that relies on…Tags: crime, hate, police, ProPublica
March 22, 2021, 7:09 a.m.
Russell Jeung, chair of the Asian American Studies Department at San Francisco State…Tags: Asian, racism
March 18, 2021, 4:02 p.m.
If you need to calculate Φ(x), the CDF of a standard normal random variable, but don’t have Φ on your calculator, you can use the approximation  Φ(x) ≈ 0.5 + 0.5*tanh(0.8 x). If you don’t have a tanh function on your calculator, you can use tanh(0.8x) = (exp(1.6x) – …
March 15, 2021, 4:52 p.m.
Chris Ume, with the help of Tom Cruise impersonator Miles Fisher, created highly…Tags: AI, deepfake, TikTok, Tom Cruise
March 11, 2021, 7:25 a.m.
Speaking of A.I. and fiction, Adam Epstein for Quartz reported on how Wattpad,…Tags: fiction, machine learning, Quartz, Wattpad
Pamela Miskhin, in collaboration with The Pudding, wrote a love story. It’s not…Tags: AI, Pamela Mishkin, Pudding, story
The Royal Statistical Society published ten lessons governments should takeaway from this year,…Tags: coronavirus, learning, Royal Statistical Society
Learn how to produce descriptive statistics for GDP data using Python data science techniques.
There was a lot of uncertainty in the beginning of the pandemic, so…Tags: coronavirus, forecasting, Youyang Gu
I recently ran across a new probability distribution called the “Scaled Beta2” or “SBeta2” for short in . For positive argument x and for positive parameters p, q, and b, its density is This is a heavy-tailed distribution. For large x, the probability density is O(x–q-1), the same as a …
March 3, 2021, 11:23 a.m.
Oftentimes we see “algorithms” referenced in various contexts, but the definition of an…Tags: algorithm, Kristian Lum