Data Science Articles

Sorted By: Most Recent
Peter Grant Peter Grant
Updated on May 28, 2025

How to Create Report-Ready Plots in Python

As a data scientist, developing great models and extrapolating nuanced insights won’t get you far if you can’t communicate your findings clearly. Here’s how to present your work using bokeh.

Sara A. Metwalli Sara A. Metwalli
Updated on May 28, 2025

5 Ways to Learn Git and Version Control

Git is a distributed version control system that tracks and manages changes to code. Learn Git and how to use it with these five resources.

Image: Shutterstock / Built In
Sergen Cansiz Sergen Cansiz
Updated on May 27, 2025

Mahalanobis Distance and Multivariate Outlier Detection in R

Mahalanobis distance is a distance metric that finds the distance between a point and a distribution. It’s often used for detecting outliers in multivariate data.

Image: Shutterstock / Built In
Anthony Figueroa Anthony Figueroa
Updated on May 27, 2025

Correlation Is Not Causation

Correlation occurs when two variables change at the same time, while causation is when a change in one variable causes the other to change. Here’s why you need to understand the difference.

Image: Shutterstock / Built In
Henri Woodcock Henri Woodcock
Updated on May 27, 2025

Stop Using NumPy’s Global Random Seed

A NumPy random seed is a numerical value in Python that initializes a random number generator, allowing for reproducible results. Here's why to use np.random.default_rng() instead to set random seeds for individual functions/classes in Python.

Image: Shutterstock / Built In
Sara A. Metwalli Sara A. Metwalli
Updated on May 27, 2025

4 Probability Distributions Every Data Scientist Needs to Know

If you’re just getting started on your journey toward becoming a data scientist, these are the 4 most common distributions you’ll encounter.

Dhilip Subramanian Dhilip Subramanian
Updated on May 23, 2025

K-Nearest Neighbor Algorithm: An Introduction

K-nearest neighbor (KNN) is an algorithm that is used to classify a data point based on how its neighbors are classified. Here’s what you need to know.

Image: Shutterstock / Built In
Artem Oppermann Artem Oppermann
Updated on May 23, 2025

How AI Teach Themselves Through Deep Reinforcement Learning

In this four-part series, I’ll show you how the Markov decision process works and the mathematical theory you need to know to understand deep reinforcement learning.

Sadrach Pierre Sadrach Pierre
Updated on May 22, 2025

An Introduction to Portfolio Optimization in Python

Portfolio optimization in Python involves using Python tools and methods to build an investment portfolio that aims to maximize returns and minimize risk. Here's how to use the Python package PyPortfolioOpt for portfolio optimization.

Image: Shutterstock / Built In
Anthony Corbo Anthony Corbo
Updated on May 22, 2025

What Are JavaScript Algorithms and Data Structures?

JavaScript algorithms allow a data operation to function precisely while JavaScript data structures are a method of storing data for efficient access and modification.

Image: Shutterstock / Built In

Related Topics