All the data you need.

Tag: Pandas

Yunnan Sourcing Tea Storefront and Analysis of the High End Tea Market
Github | LinkedIn | Yunnan Sourcing Introduction Where many online tea wholesalers curate particular, international selections of teas, Yunnan Sourcing distinguishes itself by highlighting local sources. Furthermore what makes it a compelling target for analysis is its focus on "verified purchase reviews." We will begin our analysis by laying the …
How to supercharge data exploration with Pandas Profiling
Producing insights from raw data is a time-consuming process. Predictive modeling efforts rely on dataset profiles, whether consisting of summary statistics or descriptive charts. Pandas Profiling, an open-source tool leveraging Pandas Dataframes, is a tool that can simplify and accelerate such tasks. This blog explores the challenges associated with doing …
Awesome functions in pandas and seaborn
Just a couple of handy functions to visualise and overview data
Python Autocomplete Improvements for Databricks Notebooks
At Databricks, we strive to provide a world-class development experience for data scientists and engineers, and new features are constantly getting added to our notebooks to improve our users’ productivity. We are especially excited about the latest of these features, a new autocomplete experience for Python notebooks (powered by the …
Retention-Driven Marketing for Music Apps
Github Repository | LinkedIn: Rob Davis, James Welch, Sita Thomas Background For this project we were tasked with designing a marketing strategy for KKBox, a streaming music service. We were given four datasets describing user demographics, transaction history, listening history, and churn rate. This project explores which users are the …
Beginner Python Tutorial: Analyze Your Personal Netflix Data
How much time have you spent watching The Office on Netflix? Find out with this entry-level tutorial on analyzing your own Netflix usage data! The post Beginner Python Tutorial: Analyze Your Personal Netflix Data appeared first on Dataquest.
Do You Post Too Much? Analyze Your Personal Facebook Data with Python
As of Q2 2020, Facebook claims more than 2.7 billion active users. That means that if you're reading this article, chances are you're a Facebook user. But just how much of a Facebook user are you? How much do you really post? We can find out using Python! Specifically, we're …
How and why I built Machine Learning model to predict tennis table matches results
'm a Data Professional who loves building data products to solve problems. I'm currently working together with professionals from various backgrounds to provide new analytical insights in industry. I'd love to combine my passion for open data to continue contributing to change people lives in a better and analytical world.
Tutorial: Add a Column to a Pandas DataFrame Based on an If-Else Condition
When we’re doing data analysis with Python, we might sometimes want to add a column to a pandas DataFrame based on the values in other columns of the DataFrame.Although this sounds straightforward, it can get a bit complicated if we try to do it using an if-else conditional. Thankfully, there’s …
Pandas tutorial 5: Scatter plot with pandas and matplotlib
Scatter plots are frequently used in data science and machine learning projects. In this pandas tutorial, I’ll show you two simple methods to plot one. Both solutions will... The post Pandas tutorial 5: Scatter plot with pandas and matplotlib appeared first on Data36.
How to Run a Python Script? (Step by Step Tutorial, with Example)
In this tutorial, you’ll learn how to run a Python script. And it’s quite essential. When working on data science projects, you’ll write Python code all the time…... The post How to Run a Python Script? (Step by Step Tutorial, with Example) appeared first on Data36.
10 Minutes from pandas to Koalas on Apache Spark
This is a guest community post from Haejoon Lee, a software engineer at Mobigen in South Korea and a Koalas contributor. pandas is a great tool to analyze small datasets on a single machine. When the need for bigger datasets arises, users often choose PySpark. However, the converting code from …
Pandas Cheat Sheet — Python for Data Science
If you're interested in working with data in Python, you're almost certainly going to be using the pandas library. But even when you've learned pandas — perhaps in our interactive pandas course — it's easy to forget the specific syntax for doing something. That's why we've created a pandas cheat …
Cloud Data Science 5
Welcome to Cloud Data Science 5. There were not as many announcements as last week in Cloud Data Science 4, but quantity is not what is important. The first announcement … The post Cloud Data Science 5 appeared first on Data Science 101.
✚ How to Make Line Charts in Python, with Pandas and Matplotlib
The chart type can be used to show patterns over time and relationships between variables. This is a comprehensive introduction to making them using two common libraries.Tags: Matplotlib, pandas, Python
Tutorial: Python Regex (Regular Expressions) for Data Scientists
In this tutorial, learn how to use regular expressions and the pandas library to manage large data sets during data analysis. The post Tutorial: Python Regex (Regular Expressions) for Data Scientists appeared first on Dataquest.
Excel vs Python: How to Do Common Data Analysis Tasks
In this tutorial, we’ll compare Excel and Python by looking at how to perform basic analysis tasks across both platforms. Excel is the most commonly used data analysis software in the world. Why? It’s easy to get the hang of and fairly powerful once you master it. In contrast, Python’s …
Scaling Financial Time Series Analysis Beyond PCs and Pandas: On-Demand Webinar, Slides and FAQ Now Available!
On Oct 9th, 2019, we hosted a live webinar —Scaling Financial Time Series Analysis Beyond PCs and Pandas — with Junta Nakai, Industry Leader Financial Services at Databricks, and Ricardo Portilla, Solution Architect at Databricks. This was a live webinar showcasing the content in this blog- Democratizing Financial Time Series …