Learn data science with Python. Explore the Pandas and Matplotlib libraries for analysing and plotting data.
Python scrapy code & jupyter notebook for visualizations: https://github.com/datatodavid/GF_Restaurant_Scraper Why I Scraped FindMeGlutenFree.com FindMeGlutenFree.com is a go-to website in the gluten free / celiac community. The platform works as a Yelp-like user-based restaurant search engine, except one that is meant only for restaurants with gluten-free offerings. This focus makes it …
Photo by Ivan Marc on Shutterstock GitHub Repository | LinkedIn Motivation As a dedicated Netflix user for the past several years, I often find myself scrolling over and over trying to find a TV series to watch. At some point, I came to a realization that maybe it is about …
Nov. 12, 2020, 11:26 p.m.
Github Repository | LinkedIn: Rob Davis, James Welch, Sita Thomas Background For this project we were tasked with designing a marketing strategy for KKBox, a streaming music service. We were given four datasets describing user demographics, transaction history, listening history, and churn rate. This project explores which users are the …
Nov. 10, 2020, 11:36 a.m.
Background The horse racing community has been using quantitative data to develop betting algorithms for decades. Indicators including horse bodyweight, age, and previous lap times are all utilized along with the domain specific Speed Index to predict future race outcomes. Our team was asked to answer a new question in …
Introduction Blue Nile is the leading online retailer of certified diamonds, engagement rings and fine jewelry. It claims to be the first and largest companies that sells diamonds and engagement rings online. The goal of this research is to find out the popular engagement ring styles on BlueNile.com, and identify …
Introduction Healthcare insurance fraud is not common but unfortunately, it does exist. According to the National Health Care Anti-Fraud Association, health care fraud costs around $68 billion annually in the US alone. This is only a fraction of the total revenue of the industry, so identifying fraudulent activities in healthcare …
Scatter plots are frequently used in data science and machine learning projects. In this pandas tutorial, I’ll show you two simple methods to plot one. Both solutions will... The post Pandas tutorial 5: Scatter plot with pandas and matplotlib appeared first on Data36.
The chart type can be used to show patterns over time and relationships between variables. This is a comprehensive introduction to making them using two common libraries.Tags: Matplotlib, pandas, Python
How much have I spent on Amazon? That's a scary question, but if you want to know the answer, here's how you can find it...and a lot more! The post How Much Have You Spent on Amazon? Analyzing Amazon Data appeared first on Dataquest.
Sept. 10, 2019, 3:46 p.m.
If you've already mastered the basics of iterating through Python lists, take it to the next level and learn to use for loops in pandas, numpy, and more! The post Tutorial: Advanced For Loops in Python appeared first on Dataquest.
Predict survival on the Titanic
NBA New Season has Begun! The NBA which had a lot of issues at stove league just opened. Which team will win the championship this season? Who will win the MVP? Each team has invested heavily in order to win, but the results are unknown. Was their investment appropriate? Let's …
Oct. 21, 2018, 10:11 p.m.
In this stats portfolio project, we use pandas, matplotlib, and Seaborn to research the two best markets for advertising an e-learning product.
In this portfolio project, we analyze movie ratings data to determine whether there have been any changes in Fandango's rating system.
Realtime processing of live image feed from webcam. Useful for testing image filters, video recordings, realtime video manipulation using Python.
A Simple Trending Products Recommendation Engine in Python
Feb. 28, 2017, 12:02 p.m.