Data Science Articles

Sorted By: Most Recent
Erdem İŞBİLEN Erdem İŞBİLEN
Updated on July 27, 2023

What Is Process Mining?

Process mining is an emerging data science technique that involves analyzing event logs to extract information about an organization’s underlying operational processes. Here’s how it works.

Image: Shutterstock / Built In
Abdishakur Hassan Abdishakur Hassan
Updated on July 26, 2023

4 Ways to Automate Exploratory Data Analysis (EDA) in Python

Next time, use one line of code to automate your EDA. Here’s how.

Image: Shutterstock
Sohail Hosseini Sohail Hosseini
Updated on July 25, 2023

How to Do a T-Test in Python

A t-test is a statistical method that’s used to determine whether there is a significant difference between the means of two groups. Here’s how to do it in Python.

Image: Shutterstock / Built In
Giorgos Myrianthous Giorgos Myrianthous
Updated on July 24, 2023

How to Convert a Dictionary Into a Pandas DataFrame

Learn how to convert a Python dictionary into a Pandas DataFrame in a few different ways, depending on how the data is structured and stored in dictionary.

Image: Shutterstock / Built In
Rory Spanton Rory Spanton
Updated on July 21, 2023

How to Solve FizzBuzz in R

The FizzBuzz problem is a task often given in coding interviews. Impress your interviewers and improve your code with these five solutions.

Image: Shutterstock / Built In
Parag Radke Parag Radke
Updated on July 20, 2023

Monte Carlo Tree Search: A Guide

Monte Carlo tree search (MCTS) is a heuristic search algorithm for decision processes. Here’s what you need to know.

Image: Shutterstock / Built In
Ellen Glover Ellen Glover
Updated on July 20, 2023

What Is Synthetic Data?

Synthetic data allows people to digitally generate the exact data they need, on demand.

Image: Shutterstock / Built In
Giorgos Myrianthous Giorgos Myrianthous
Updated on July 20, 2023

How to Place the Legend Outside the Plot in Matplotlib

When plotting figures with Matplotlib, learn how you can place the legend of a figure outside of the plot area.

Image: Shutterstock / Built In
Chris Dowsett Chris Dowsett
Updated on July 20, 2023

What Is a Data Lake?

A data lake is a digital storage area where businesses hold structured and unstructured data including social media data, log files, emails, images and videos.

Image: Shutterstock / Built In
Suraj Gurav Suraj Gurav
Updated on July 17, 2023

An Introduction to the Confusion Matrix in Python

A confusion matrix is used to evaluate the accuracy of your classification model. Here’s how to build one in Python and read it.

Image: Shutterstock / Built In

Related Topics