Data Science Articles

Sorted By: Most Recent
Sara A. Metwalli Sara A. Metwalli
Updated on June 02, 2025

10 Steps to Become a Data Scientist

Use this roadmap to kickstart your data science career.

Cassie Kozyrkov Cassie Kozyrkov
Updated on June 02, 2025

Statistics: Are You Bayesian or Frequentist?

Is your statistical alignment Bayesian or a Frequentist? It all comes down to random variables. Learn more.

Image: Shutterstock / Built In
Peter Grant Peter Grant
Updated on May 29, 2025

How to Use Python Functions Effectively: 6 Tips to Know

Here are six things you need to know about using these powerful tools in order to write more Pythonic code.

Sara A. Metwalli Sara A. Metwalli
Updated on May 29, 2025

4 Essential Skills Every Data Scientist Needs

There’s more to data science than data. These 4 skills will help you land (and keep!) that dream job.

Mitchell Telatnik Mitchell Telatnik
Updated on May 28, 2025

Machine Learning for Beginners (With Weka)

Follow along with this machine learning for beginners tutorial, which walks through the basics of classification and regression algorithms and how to build a machine learning model in Weka.

Image: Shutterstock / Built In
Sergen Cansiz Sergen Cansiz
Updated on May 28, 2025

Covariance Matrix: Definition, Derivation and Applications

A covariance matrix is a square matrix that shows the covariance between every pair of variables in a given data set, where each element in the matrix represents the corresponding covariance.

Image: Shutterstock / Built In
Parul Pandey Parul Pandey
Updated on May 28, 2025

What Is the Dummy Variable Trap? (With Pandas Code Examples)

Here are a few important caveats to keep in mind when you’re encoding data with pandas.get_dummies().

Peter Grant Peter Grant
Updated on May 28, 2025

How to Create Report-Ready Plots in Python

As a data scientist, developing great models and extrapolating nuanced insights won’t get you far if you can’t communicate your findings clearly. Here’s how to present your work using bokeh.

Sara A. Metwalli Sara A. Metwalli
Updated on May 28, 2025

5 Ways to Learn Git and Version Control

Git is a distributed version control system that tracks and manages changes to code. Learn Git and how to use it with these five resources.

Image: Shutterstock / Built In
Sergen Cansiz Sergen Cansiz
Updated on May 27, 2025

Mahalanobis Distance and Multivariate Outlier Detection in R

Mahalanobis distance is a distance metric that finds the distance between a point and a distribution. It’s often used for detecting outliers in multivariate data.

Image: Shutterstock / Built In

Related Topics