All the data you need.

Tag: Python

Building a Safer Future
Dive into the heart of NYC construction safety. Through thorough analysis of accident reports from the Department of Building database we will unravel the who, where, when, and why behind injuries and fatalities in the ultimate goal to inspire construction reform. Together, let's "Build a Safer Future" for workers and …
Emerging Tools and Frameworks in AI: A Comparative Analysis
In this contributed article, graphic designer and content writer, Erika Ballo delves into some emerging tools and frameworks in AI, comparing their strengths, usability, and ideal use cases.
Kepler triangle
A Kepler triangle is a right triangle whose sides are in geometric progression. That is, if the sides have length a < b < c, then b/a = c/b = k. All Kepler triangles are similar because the proportionality constant k can only take on one value. To see this, …
Factored random numbers
A couple days ago Michael Nielsen posted an image of a one-page paper that gives an algorithm for generating factored random numbers, uniformly distributed from 1 to some designated N. The algorithm does not generate random numbers then factor them. It’s more efficient than that, generating the factorization along with …
New Book: Statistical Optimization for GenAI and Machine Learning
In the last two years, I published 5 machine learning and AI books, including one on synthetic data by Elsevier. This represents over 800 pages of compact, state-of-the-art material. The new addition features my most recent advances: the problems that I encountered with generative adversarial networks, and how I overcome …
Be a YouTube mrBeast
Introduction: The goal of this study is to analyze the variables that contributed to YouTubers’ success in 2023, specifically their subscribers and earnings. Before identifying those variables, we have to establish the measure of success. Some questions I would like to have answered: Data: The dataset I used, Global YouTube …
Exploring Chess Openings: Can We Pinpoint a 'Best' Opening?
Chess, one of most popular and well recognized board games in the world, has experienced a recent resurgence over the past few years. This revival has attracted many new players to chess, including myself. Even though the knowledge of strategy and tactics in chess is vast, openings stand out as …
Python code for means
The last couple article have looked at various kinds of mean. The Python code for four of these means is trivial: gm = lambda a, b: (a*b)**0.5 am = lambda a, b: (a + b)/2 hm = lambda a, b: 2*a*b/(a+b) chm = lambda a, b: (a**2 + b**2)/(a + …
Cross-platform way to enter Unicode characters
The previous post describes the hoops I jumped through to enter Unicode characters on a Mac. Here’s a script to run from the command line that will copy Unicode characters to the system clipboard. It runs anywhere the Python module pyperclip runs. #!/usr/bin/env python3 import sys import pyperclip cp = …
PyTorch is Exceedingly Good for AI and Data Science Practice
The PyTorch community has made remarkable strides in recent times. Last year, contributors of PyTorch introduced BetterTransformer inference optimizations for transformer models such as GPT, which have significantly improved the […] The post PyTorch is Exceedingly Good for AI and Data Science Practice appeared first on Datafloq.
Simulate the Monty Hall problem in Python 🐐🚘🐐
Use Python to solve this classic probability puzzle that has stumped mathematicians and Nobel Prize winners!
✚ Why I Use R More than Python
Debates about which is best are useless. Go with what works for you, and never mind the rest.Tags: Python, R, tools
Starburst Introduces Python DataFrame Support for Complex Data Transformation and Data Application Workloads
Starburst, the data lake analytics platform, today extended their support for the most widely used multi-purpose, high-level programming language, Python with PyStarburst, as well as announced a new integration with the open source Python library, Ibis, built in collaboration with composable data systems builder and Ibis maintainer, Voltron Data.
What is a Developer Advocate, TDI 16 with @arynnpost
“How well can you write about a process, how cohesive is it, and can you write for different audiences?” — A M-P (@arynnpost) on Threads First off, @arynnpost, your profile lists you as a developer advocate. What is a developer advocate and how did you get into that role? Developer …
Threads Dev Interview 11: @devben.tech
“if I could offer a piece of advice to my past self regarding development, it would be to embrace a growth mindset. “ —Dev Ben (@devben.tech) on Threads Today we have @devben.tech. Welcome @devben.tech, how did you get started with computer programming? I’m excited to be part of Threads Dev …
✚ How I Made That: Network Diagrams of All the Household Types
Process the data into a usable format, which makes the visualization part more straightforward.Tags: Illustrator, Python, R
UMass Amherst Computer Scientists Use AI to Accelerate Computing Speed by Thousands of Times
A team of computer scientists at the University of Massachusetts Amherst, led by Emery Berger, recently unveiled a prize-winning Python profiler called Scalene. Programs written with Python are notoriously slow—up to 60,000 times slower than code written in other programming languages—and Scalene works to efficiently identify exactly where Python is …
Curvature at Cairo
I was flipping through Gravitation [1] this weekend and was curious about an illustration on page 309. This post reproduces that graph. The graph is centered at Cairo, Egypt and includes triangles whose side lengths are the distances between cities. The triangles are calculated using only distances, not by measuring …