Data Science Articles

Sorted By: Most Recent
Anmol Tomar Anmol Tomar
Updated on March 14, 2025

Elbow Method in K-Means Clustering: Definition, Drawbacks, vs. Silhouette Score

The elbow method is a technique used to find the optimal number of clusters (K) in k-means clustering, by identifying the “elbow” point on a graph of k-values and their corresponding within-cluster sum of squares (WCSS) values.

Image: Shutterstock / Built In
Stephen Gossett Stephen Gossett
Updated on March 13, 2025

Dscout Discusses How to Collect User Data on Gender Identity — and When Not to

Gender data can be valuable for a variety of reasons. But before you ask, consider why you’re asking, and how you frame the question.

Valentina Alto Valentina Alto
Updated on March 13, 2025

Understanding Ordinary Least Squares (OLS) Regression

Ordinary Least Squares (OLS) regression is a technique used in linear regression to minimize the sum of squared differences between observed and predicted values, and obtain a straight line as close as possible to your data points.

Image: Shutterstock / Built In
Sadrach Pierre Sadrach Pierre
Updated on March 12, 2025

How to Heapify a Tree in C++

A step-by-step tutorial on how to heapify data in a tree data structure, including visuals and example code.

Image: Shutterstock / Built In
Mae Rice Mae Rice
Updated on March 12, 2025

Machine Learning Career Path: How to Become a Machine Learning Engineer

We asked three machine learning engineers how they got into their roles and how newcomers can follow in their footsteps.

Shesh Narayan Gupta Shesh Narayan Gupta
Updated on March 11, 2025

How to Set Up and Optimize DeepSeek Locally

Our expert explains everything you need to know about installing DeepSeek locally on both Mac and PC. Learn more.

Image: Shutterstock / Built In
Carla Martins Carla Martins
Updated on March 10, 2025

Gaussian Naive Bayes Explained With Scikit-Learn

Gaussian Naive Bayes is a classification technique used in machine learning based on the probabilistic approach and Gaussian distribution. Here’s a deep dive on how to use it in Scikit-Learn.

Image: Shutterstock / Built In
KSV Muralidhar KSV Muralidhar
Updated on March 05, 2025

R-Squared and Adjusted R-Squared: Explained

Adjusted R-squared is a modified version of R-squared that adjusts for predictors that do not contribute to predictive accuracy in a regression model. It can be a reliable measure of goodness of fit for multiple regression problems.

Image: Shutterstock / Built In
Mike Thomas Mike Thomas
Updated on March 04, 2025

28 Machine Learning in Marketing Examples

These companies are fine-tuning their marketing efforts with machine learning.

Image: Shutterstock
Lynn Kwong Lynn Kwong
Updated on March 03, 2025

Asyncio in Python: A Guide

Asyncio is a Python library that allows us to write concurrent code using the async/await syntax. Learn how to use this library to write asynchronous code.

Image: Shutterstock / Built In

Related Topics