Data Science Articles

Sorted By: Most Recent
Henri Woodcock Henri Woodcock
Updated on May 27, 2025

Stop Using NumPy’s Global Random Seed

A NumPy random seed is a numerical value in Python that initializes a random number generator, allowing for reproducible results. Here's why to use np.random.default_rng() instead to set random seeds for individual functions/classes in Python.

Image: Shutterstock / Built In
Sara A. Metwalli Sara A. Metwalli
Updated on May 27, 2025

4 Probability Distributions Every Data Scientist Needs to Know

If you’re just getting started on your journey toward becoming a data scientist, these are the 4 most common distributions you’ll encounter.

Dhilip Subramanian Dhilip Subramanian
Updated on May 23, 2025

K-Nearest Neighbor Algorithm: An Introduction

K-nearest neighbor (KNN) is an algorithm that is used to classify a data point based on how its neighbors are classified. Here’s what you need to know.

Image: Shutterstock / Built In
Artem Oppermann Artem Oppermann
Updated on May 23, 2025

How AI Teach Themselves Through Deep Reinforcement Learning

In this four-part series, I’ll show you how the Markov decision process works and the mathematical theory you need to know to understand deep reinforcement learning.

Sadrach Pierre Sadrach Pierre
Updated on May 22, 2025

An Introduction to Portfolio Optimization in Python

Portfolio optimization in Python involves using Python tools and methods to build an investment portfolio that aims to maximize returns and minimize risk. Here's how to use the Python package PyPortfolioOpt for portfolio optimization.

Image: Shutterstock / Built In
Anthony Corbo Anthony Corbo
Updated on May 22, 2025

What Are JavaScript Algorithms and Data Structures?

JavaScript algorithms allow a data operation to function precisely while JavaScript data structures are a method of storing data for efficient access and modification.

Image: Shutterstock / Built In
Dhanushka Madushan Dhanushka Madushan
Updated on May 22, 2025

How Database B-Tree Indexing Works 

B-tree indexing organizes large volumes of data such that each node contains keys in ascending order, resulting in faster database searches. Here’s why this technique is important, how it’s used and how it works with SQLite.

Image: Shutterstock / Built In
Catherine Gitau Catherine Gitau
Updated on May 22, 2025

Fuzzy String Matching in Python: Introduction to FuzzyWuzzy

Fuzzy string matching is the process of finding strings that approximately match each other. Here’s more on how fuzzy string matching works and how to perform the process using the Python library FuzzyWuzzy.

Image: Shutterstock / Built In
Matthew Urwin Matthew Urwin
Updated on May 22, 2025

What Is Quantitative Modeling?

In finance, quantitative modeling is the practice of organizing and interpreting data sets with mathematical formulas to identify trends in the broader markets. Here’s why it’s important, its key characteristics and the tools it involves.

Image: Shutterstock / Built In
Abdishakur Hassan Abdishakur Hassan
Updated on May 22, 2025

Create Interactive Dashboards With Panel and Python

A Python dashboard is an interactive application that displays data visually using charts, tables and controls like buttons or sliders. Here's how to create flexible and powerful dashboards in Python with the Panel library.

Image: Shutterstock / Built In

Related Topics