Data Science Articles

Sorted By: Most Recent
Rahul Agarwal Rahul Agarwal
Updated on October 07, 2024

A Complete Guide to PySpark DataFrames

Bookmark this cheat sheet. It contains all the information you’ll need on DataFrame functionality.

Image: Shutterstock / Built In
Rebecca Vickery Rebecca Vickery
Updated on October 07, 2024

How to Create a Pandas Pivot Table in Python

Pandas’ pivot_table function operates similar to a spreadsheet, making it easier to group, summarize and analyze your data. Here’s how to create your own.

Image: Shutterstock / Built In
Suraj Gurav Suraj Gurav
Updated on October 07, 2024

How to Rename Columns in Pandas

Learn how to rename columns in the Pandas Python library. Master these four techniques to deal with messy column names in Pandas DataFrame.

Image: Shutterstock / Built In
Anuja Nagpal Anuja Nagpal
Updated on October 03, 2024

L1 and L2 Regularization Methods, Explained

L1 and L2 regularization are the best ways to manage overfitting and perform feature selection when you’ve got a large set of features in a machine learning model.

Image: Shutterstock / Built In
Indhumathy Chelliah Indhumathy Chelliah
Updated on October 01, 2024

How to Find the Index of a Substring in Python

Complete guide on using string methods and regexes in Python to find the index of a substring. Learn all five methods.

Image: Shutterstock / Built In
Anthony Corbo Anthony Corbo
Updated on September 30, 2024

What Are Data Structures?

A data structure is a format for organizing, processing, retrieving and storing data so it can be easily accessed and effectively used.

Image: Shutterstock / Built In
Xuenong Hong Xuenong Hong
Updated on September 30, 2024

Graph Neural Network (GNN): What It Is and How to Use It

GNN is a powerful tool to help you analyze structural data. This introduction shows you how.

Image: Shutterstock / Built In
Oscar Contreras Carrasco Oscar Contreras Carrasco
Updated on September 30, 2024

Gaussian Mixture Model Explained 

A Gaussian mixture model is a soft clustering machine learning method used to determine the probability each data point belongs to a given cluster. Learn more.

Image: Shutterstock / Built In
Will Koehrsen Will Koehrsen
Updated on September 25, 2024

Precision and Recall: How to Evaluate Your Classification Model

When assessing your classification model, make sure to choose the right metrics. Here’s how.

Image: Shutterstock / Built In
Suraj Gurav Suraj Gurav
Updated on September 19, 2024

How to Use Pass, Continue and Break in Python

Pass, continue and break statements in Python are used to alter the flow of a loop in different ways. Here’s what you need to know to use them effectively.

Image: Shutterstock / Built In

Related Topics