All the data you need.

Tag: Tutorial

Tutorial: Text Classification in Python Using spaCy
Learn text classification using linear regression in Python using the spaCy package in this free machine learning tutorial. The post Tutorial: Text Classification in Python Using spaCy appeared first on Dataquest.
Unleash the potential of Recommender Systems
Recommender systems are one of the most popular algorithms in data science today. In this tutorial, we will build a movie recommender system.
✚ How to Make a Mosaic Plot in R
Also known as a Marimekko diagram, the mosaic plot lets you compare multiple qualitative variables at once. They can be useful, sometimes.Tags: R
✚ How to Make Animated (GIF) Heatmaps in R
Using color as the visual encoding, show changes over time in two dimensions.Tags: animation, R
Hackathon Winner Interview: Hanyang University | Kaggle University Club
Welcome to the third and final installment of our University Club winner interviews! This week the spotlight is on a top-scoring university team, TEAM-EDA from Hanyang University in Korea! Today’s university students are tomorrow’s leading data scientists. That's the catalyst for Kaggle University Club — a virtual community and Slack …
Hackathon Winner Interview: Friendship University of Russia | Kaggle University Club
Welcome to the second installment of our University Club winner interviews! Today’s university students are tomorrow’s leading data scientists. That's the catalyst for Kaggle University Club — a virtual community and Slack channel for existing data science clubs who want to compete in Kaggle competitions together. As our end-of-year event …
Classification using Decision Trees
Decision trees happen to be one of simplest and easiest to explain classification models and, as many argue, closely resemble the human decision making. This blog post has been developed to help you revisit and master the fundamentals of decision tree classification models.
Classification using Decision Trees
Decision trees happen to be one of simplest and easiest to explain classification models and, as many argue, closely resemble the human decision making. This blog post has been developed to help you revisit and master the fundamentals of decision tree classification models.
Hackathon Winner Interview: Penn State | Kaggle University Club
We believe today’s university students are tomorrow’s leading data scientists. As such, we decided to launch Kaggle University Club — a virtual community and Slack channel for existing data science clubs who want to compete in Kaggle competitions together. As our end-of-year event, we hosted our first-ever University Hackathon! 18 …
Unfolding Naive Bayes from Scratch: Part 2
If you had to get started with one machine learning algorithm, Naive Bayes would be a good choice, as it is one of the most common machine learning algorithms that can do a fairly good job at most classification tasks.
How to Gather Your Own Data by Conducting a Great Survey
In this post, we’ll learn to create an online survey and how to prevent some common mistakes made in surveys.
✚ How to Make Frequency Trails in R
Also known as ridgeline plots, the method overlaps time series for a 3-D-ish view of the data. While perhaps not the most visually efficient, the allure is undeniable.Tags: R
11 Design Tips for Data Visualization
Making your data both visually understandable and visually attractive can help it get more attention and help you stand out, whether your audience is the entirety of the internet or just your company’s CEO in the weekly meeting.
✚ How to Make an Animated Pyramid Chart with D3.js
Compare distributions side-by-side with a pyramid chart. Observe the change over the years by animating it.Tags: animation, d3js, pyramid
Python Dictionary Tutorial
Use this tutorial to become familiar with dictionaries and their common uses so that you can start incorporating them immediately into your own code.
✚ How to Make a Tiled Bar Chart with D3.js
Show individual data points by splitting bars into smaller cells. Tags: d3.js
Help! I can’t reproduce a machine learning project!
Have you ever sat down with the code and data for an existing machine learning project, trained the same model, checked your results… and found that they were different from the original results? Not being able to reproduce someone else’s results is super frustrating. Not being able to reproduce your …
✚ How to Make Better-Looking, More Readable Charts in R
Defaults are generalized settings to work with many datasets. This is fine for analysis, but data graphics for presentation benefit from context-specific design.Tags: parameters, R