Data Science Articles

Sorted By: Most Recent
Peter Grant Peter Grant
Updated on March 06, 2023

What Is a Monte Carlo Simulation?

Monte Carlo simulations are a tool we use to predict the probability of various outcomes in a process that’s difficult to assess due to random variables. Here’s how to perform one yourself.

Image: Shutterstock / Built In
Sergen Cansiz Sergen Cansiz
Updated on March 06, 2023

Mahalanobis Distance and Multivariate Outlier Detection in R

Mahalanobis Distance is an effective distance metric that finds the distance between a point and a distribution. It’s very effective on multivariate data.

Image: Shutterstock / Built In
Eric Kleppen Eric Kleppen
Updated on March 03, 2023

What Is Sentiment Analysis?

We use sentiment analysis to gain insights into a target audience’s feelings about a particular topic. Here are the basics of sentiment analysis types and techniques.

Image: Shutterstock / Built In
Akshay Kumar Akshay Kumar
Updated on March 02, 2023

How to Implement Binary Search in Python

Binary search is a method for searching a sorted list of data to find an item. Here’s how to implement it in Python.

Image: Shutterstock / Built In
Adrienne Kline Adrienne Kline
Updated on March 02, 2023

Parametric vs. Non-Parametric Tests and When to Use Them

A parametric test makes assumptions while a non-parametric test does not assume anything.

Image: Shutterstock / Built in
Nicolai Berg Andersen Nicolai Berg Andersen
Updated on March 01, 2023

What Is JSON?

We use JSON to both store and transfer data. Here’s what you need to know to try JSON out for yourself.

Image: Shutterstock / Built In
Paras Varshney Paras Varshney
Updated on February 28, 2023

Q-Q Plots Explained

Q-Q plots allow data scientists and other statisticians to graphically compare two probability distributions to determine normal distribution. Our expert explains how they work here.

Image; Shutterstock / Built In
Dhanushka Madushan Dhanushka Madushan
Updated on February 28, 2023

How Database B-Tree Indexing Works 

B-tree is used for indexing and is a data structure that provides sorted data and allows searches, sequential access, attachments and removals in sorted order. 

Image: Shutterstock / Built In
Madhav L Mishra Madhav L Mishra
Updated on February 24, 2023

Central Limit Theorem (CLT) Definition and Examples

What is the central limit theorem? Here's the statistical concept explained using a six-sided die.

Image: Shutterstock
Sadrach Pierre Sadrach Pierre
Updated on February 24, 2023

What Is a Function in Python?

A function is a named section of code that performs a specific task. Here, our expert introduces you to how they work in Python.

Image: Shutterstock / Built In

Related Topics