All the data you need.

Tag: Statistics

Babies and the beta-binomial distribution
About half of children are boys and half are girls, but that doesn’t mean that every couple is equally likely to have a boy or a girl each time they conceive a child. And evidence suggests that indeed the probability of conceiving a girl varies per couple. I will simplify …
Changes to Blackjack payouts so that gamblers lose more to casinos
Katherine Sayre, for The Wall Street Journal, on Las Vegas casinos squeezing out…Tags: gambling, Las Vegas, probability
Top Data Science Ph.D. Dissertations (2019-2020)
The American Mathematical Society (AMS) recently published in its Notices monthly journal a long list of all the doctoral degrees conferred from July 1, 2019 to June 30, 2020 for mathematics and statistics. The degrees come from 242 departments in 186 universities in the U.S. I enjoy keeping a pulse …
Demonstrating how large language models work
You might’ve heard about large language models lately. They’re the “brains” behind recent…Tags: AI, language, model, Upshot
Can you have confidence in a confidence interval?
“The only use I know for a confidence interval is to have confidence in it.” — L. J. Savage Can you have confidence in a confidence interval? In practice, yes. In theory, no. If you have a 95% confidence interval for a parameter θ, can you be 95% sure that …
AI and the American smile
Jenka Gurfinkel discusses the appearance of the American smile in AI-generated images and…Tags: AI, bias, images, midjourney
Privacy and tomography
I ran across the image below [1] when I was searching for something else, and it made me think of a few issues in data privacy. The green lights cut across the man’s body like tomographic imaging. No one of these green lines would be identifiable, but maybe the combination …
Privacy implications of hashing data
Cryptographic hash functions are also known as one-way functions because given an input x, one can easily compute its hashed value f(x), but it is impractical to recover x from knowing f(x). However, if we know that x comes from a small universe of possible values, our one-way function can …
Curiously timed stock trades by ultra-wealthy
Continuing an analysis of IRS records, Robert Faturechi and Ellis Simani for ProPublica…Tags: investments, IRS, ProPublica, wealth
NBA will track players with a third dimension
The NBA currently uses player-tracking that estimates player position on the court in…Tags: basketball, ESPN, NBA, tracking
Why popular songs have so many writing credits
For Tedium, Chris Dalla Riva examined why the number of credited songwriters per…Tags: Chris Dalla Riva, music, producers, Tedium
Burr distribution
Irving Burr came up with a set of twelve probability distributions known as Burr I, Burr II, …, Burr XII. The last of these is by far the best known, and so the Burr XII distribution is often referred to simply as the Burr distribution. Cumulative density functions (CDFs) of …
Sources and attribution for AI-generated images
AI-based image generation take bits and pieces from existing people-made images and tries…Tags: AI, attribution, ethics, Stable Diffusion
Generating music from text
Researchers at Google built a model that generates music based on brief text…Tags: Google, machine learning, music
Manual removal of harmful text to train AI models
AI training data comes from the internet, and as we know but maybe…Tags: AI, cleaning, ethics, OpenAI, Sama
Generative AI trade-offs
People have been having fun with generative AI lately. Enter a prompt and…Tags: AI, ethics, generative art, Maggie Appleton
Maxwell-Boltzmann and Gamma
When I shared an image from the previous post on Twitter, someone who goes by the handle Nonetheless made the astute observation that image looked like the Maxwell-Boltzmann distribution. That made me wonder what 1/Γ(x) would be like turned into a probability distribution, and whether it would be approximately like …
Stable Diffusion to generate spectrograms to convert to sounds
Stable Diffusion is an AI model that lets you enter text to generate…Tags: AI, music, spectrogram, Stable Diffusion