Data Science Articles

Sorted By: Most Recent
Michael Galarnyk Michael Galarnyk
Updated on November 07, 2022

Explaining the Empirical Rule for Normal Distribution

The empirical rule, also known as the 68-95-99.7 rule, represents the spread of data within a normal distribution. Here’s what you need to know.

Image: Shutterstock / Built In
Michael Galarnyk Michael Galarnyk
Updated on November 07, 2022

Understanding Train Test Split

Train test split is a model validation procedure that reveals how your model performs on new data. Here’s how to apply it.

Image: Shutterstock / Built In
Behic Guven Behic Guven
Updated on November 02, 2022

Top Cloud Computing Platforms With Certification Programs to Know

Cloud computing platforms allow businesses to access their data and run their operations over the internet rather than hard drives and are essential for growth. Here are the platforms you need to know.

Image: Shutterstock / Built In
Thiago Carvalho Thiago Carvalho
Updated on November 02, 2022

Grouping Data With R: A Guide

Grouping data is an important step in the data analysis process, allowing you to summarize important information. Here’s how to group it in R.

Image: Shutterstock / Built In
Satyam Kumar Satyam Kumar
Updated on October 28, 2022

C-Means Clustering Explained

C-means clustering is a clustering technique in which each data point is grouped into different clusters and assigned a probability score. Here’s what you need to know.

Image: Shutterstock / Built In
Dhilip Subramanian Dhilip Subramanian
Updated on October 27, 2022

K-Nearest Neighbor Algorithm: An Introduction

K-nearest neighbor (KNN) is an algorithm that is used to classify a data point based on how its neighbors are classified. Here’s what you need to know.

Image: Shutterstock / Built In
Stephen Gossett Stephen Gossett
Updated on October 26, 2022

These 11 Startups Are Working on Data Privacy in Machine Learning

Techniques like federated learning and differential privacy are maturing — and attracting significant VC investment.

Stephen Gossett Stephen Gossett
Updated on October 26, 2022

Bar Chart Races Are Everywhere. Here’s Why Some Data Viz Experts Hate Them.

Bar chart races got banned from the biggest data viz subreddit. How on Earth did they get so controversial?

Anthony Figueroa Anthony Figueroa
Updated on October 25, 2022

Correlation Is Not Causation

Correlation occurs when two variables change at the same time, while causation is when a change in one variable causes the other to change. Here’s why you need to understand the difference.

Image: Shutterstock / Built In
Eric Kleppen Eric Kleppen
Updated on October 24, 2022

How to Find Variance Using Python

If you’re wondering how to find the variance in your data set, look no further. Here’s how to calculate variance in a snap with Pandas.

Related Topics