All the data you need.

Tag: Python

Differences between List, Tuple, Set, and Dictionary in Python
Overview Python's data structures give us a mechanism to organize data in a way that makes it easy to access and modify. Collections are among the data structures in these. […] The post Differences between List, Tuple, Set, and Dictionary in Python appeared first on Datafloq.
Exploring Polycystic Ovarian Syndrome (PCOS) Symptoms with Data
According to the Endocrine Society, Polycystic ovarian syndrome, PCOS, is one of the most common causes of female infertility, affecting as many as 5 million American women of childbearing age. Women affected by PCOS may produce higher than normal amounts of male hormones , which may impact their overall health, …
AM over GM
Suppose you take the arithmetic mean and the geometric mean of the first n integers. The ratio of these two means converges to e/2 as n grows [1]. In symbols, Now suppose we wanted to visualize the convergence by plotting the expression on the left side for a sequence of …
Audience Text Mining
A Step-by-Step Tutorial on Topic Modeling, Summarization, Keyword Extraction and Sentiment Analysis Using Pre-trained classifiers on Yelp User Reviews Dataset
Machine Learning 101: An Introduction to the Core Concepts
What's Machine Learning All About? Find out in this post!
An Intriguing Job Interview Question for AI/ML Professionals
In my last project, I had to come up with some code and algorithm to solve an interesting problem. I realized that it could lead to some off-the-beaten-path job interview question. The problem is a fundamental one. The level ranges from elementary school to one of the most difficult unsolved …
Enhancing Analysis with Model Interpretability: A Real Estate Dashboard
Machine learning’s rising importance and ubiquity carries with it promises of profitability and efficiency. Basic machine learning models have never been simpler to create. Python and R have easy-to-use libraries that handle all of the heavy machinery. The inner workings are inaccessible when abstracted away, leading to the term “black …
Flight Delays in the COVID-19 Era: Insights from Top Carriers
The COVID-19 pandemic brought major disruption to many of our lives. Work and school became remote, supply chain issues arose, and multiple industries saw a dearth of demand. The travel industry was hit particularly hard as international travel became very difficult, and even domestic flights were rare. We sought to …
Flight Delays in the COVID-19 Era: Insights from Top Carriers
The COVID-19 pandemic brought major disruption to many of our lives. Work and school became remote, supply chain issues arose, and multiple industries saw a dearth of demand. The travel industry was hit particularly hard as international travel became very difficult, and even domestic flights were rare. We sought to …
My 6 Best AI and Machine Learning Articles
Since starting my own AI / machine learning research lab over a year ago, I published 24 technical papers and 4 books, in addition to my articles on Data Science Central. Here I list the most popular ones in random order, with a short summary. The number attached to each …
Golden integration
Let φ be the golden ratio. The fractional parts of nφ bounce around in the unit interval in a sort of random way. Technically, the sequence is quasi-random. Quasi-random sequences are like random sequences but better in the sense that they explore a space more efficiently than random sequences. For …
Building an Automated Data Pipeline for Retail Trade Survey Data
Please see my Github repo for details on the code and pipeline set up. 1. Introduction The objective of this project is to build an ETL pipeline that could support the analysis of the Monthly Retail Trade Survey (MRTS) data. The data is stored in an Excel file by year …
Building an Automated Data Pipeline for Retail Trade Survey Data
The US Census Bureau traces the sales number of retail and food services periodically since 1992. This data is used widely among the government and various types of organizations to predict industry trends, market shares, and all sorts of other applications. However, the data is not formatted in a way …
Predict Retention Rate & LTV – Sree
body { text-align: justify}
Unlocking Home Value
Leveraging Kitchen Quality and Neighborhood Insights for Strategic Renovations Watch Our Presentation Github Repository Introduction According to NerdWallet, one of the top ways you can increase the value of your home is with a kitchen renovation. For this project, we cosplayed as a kitchen renovation company in Ames, Iowa during …
Building a dataset of Python versions with regular expressions
Learn how to use pandas, requests, and regular expressions ("regex") to create a dataset of every Python version and its release date!
EDA and machine learning Ames housing price prediction project
Introduction Buying and investing in the real estate market is one of the biggest decisions people make To be certain that we're making a good real estate purchase, we need to know whether a house is priced fairly or even underpriced. For this project, I used the Kaggle dataset to …
What can data say about work-life balance and achievement?
Introduction Why is work-life balance important? A healthy work-life balance is essential for many reasons. A heavy workload and excessive working hours can shift the work-life balance and make it difficult for employees to take care of their health, personal life, and family and keep their marriages, social needs, and …