Data Science Articles

Sorted By: Most Recent
Giorgos Myrianthous Giorgos Myrianthous
Updated on May 22, 2025

16 Bash Commands Data Scientists Must Know

Bash commands are an important part of the data scientist’s toolkit. This guide introduces you to some of the most important ones.

Image: Shutterstock / Built In
Barak Or Barak Or
Updated on May 22, 2025

Inertial Measurement Unit (IMU) Explained

An inertial measurement unit (IMU) is a device that uses various sensors to capture data about an object’s motion, location and orientation. Here’s how IMUs work and what the pros and cons are of using them.

Image: Shutterstock / Built In
Edoardo Romani Edoardo Romani
Updated on May 22, 2025

What Is Database Normalization?

Database normalization refers to organizing data into tables to improve the efficiency of a database and ensure the consistency and accuracy of its data. Here’s why it matters and the normal forms involved in the process.

Image: Shutterstock / Built In
Sara A. Metwalli Sara A. Metwalli
Updated on May 22, 2025

What Is Data Validation?

Data validation refers to verifying the quality and accuracy of data before using it. These are the main types of data validation, the pros and cons of the process and tips for how to perform data validation.

Image: Shutterstock / Built In
Peter Grant Peter Grant
Updated on May 22, 2025

An Introduction to Bias-Variance Tradeoff

The bias-variance tradeoff describes the inverse relationship between bias and variance, where increasing one decreases the other. Here’s how to strike a balance between the two, so a model learns enough details about a data set without picking up noise.

Image: Shutterstock
Parul Pandey Parul Pandey
Updated on May 22, 2025

Sorting Data Frames in Pandas: A Hands-On Guide

Pandas DataFrames can be sorted by column, index, multiple columns and more. This tutorial introduces you to the basics.

Image: Shutterstock / Built In
Peter Grant Peter Grant
Updated on May 22, 2025

How to Use Float in Python (With Sample Code!)

In Python, floats are a common data type that lets users work with decimal numbers, covering a wider range of values than integers. Check out this quick tutorial on how to make and use floats in Python.

Image: Shutterstock / Built In
Zolzaya Luvsandorj Zolzaya Luvsandorj
Updated on May 22, 2025

A Beginner’s Guide to Propensity Score Matching

Propensity score matching is a causal inference technique that attempts to balance treatment groups on confounding factors, so researchers can gauge the treatment’s causal impact on the outcome. Here are the steps to conduct propensity score matching.

Image: Shutterstock / Built In
Andrew Plummer Andrew Plummer
Updated on May 22, 2025

Box-Cox Transformation and Target Variable: Explained

Box-Cox transformation is a statistical technique that transforms your target variable so that it resembles a normal distribution. Here’s how to implement it in Python.

Image: Shutterstock / Built In
Perez Ogayo Perez Ogayo
Updated on May 22, 2025

How to Fix a CUDA Error: Device-Side Assert Triggered in PyTorch

A CUDA Error: Device-Side Assert Triggered can either be caused by an inconsistency between the number of labels and output units in a model or an incorrect input for a loss function. Follow this guide to fix it. 

Image: Shutterstock / Built In

Related Topics