Data Science Articles

Sorted By: Most Recent
Mrinal Tyagi Mrinal Tyagi
Updated on April 02, 2024

Histogram of Oriented Gradients: An Overview

Histogram of oriented gradients (HOG) is a feature descriptor used in computer vision and image processing for object detection. Learn how it works.

Image: Shutterstock / Built In
Srik Gorthy Srik Gorthy
Updated on April 01, 2024

Euclidean Distance Explained

Euclidean distance measures the length of the shortest line between two points. It’s commonly used in machine learning algorithms. Learn how to calculate it in Python.

Image: Shutterstock / Built In
Rahul Agarwal Rahul Agarwal
Updated on March 29, 2024

ROC Curves and AUC: The Ultimate Guide

ROC curves are one of the most common evaluation metrics for checking a classification model’s performance. This guide will help you to truly understand how ROC curves and AUC work together.

Image: Shutterstock / Built In
Asim Rahal Asim Rahal
Updated on March 20, 2024

Don’t Be Compliance-First. Be Risk-First Instead.

A risk-first approach focuses on a company’s most significant compliance risks rather than an arbitrary list of regulations.

Image: Shutterstock / Built In
Michael Galarnyk Michael Galarnyk
Updated on March 14, 2024

How to Use the Z-Table

A z-table, or standard normal table, reveals what percentage of values fall below a certain z-score in a normal distribution. Here’s how to use one and create your own.

Image: Shutterstock / Built In
Joel Hereth Joel Hereth
Updated on March 12, 2024

How to Optimize SQL for Large Data Sets

Optimizing SQL for large data sets is an important step in managing the performance of your database. Follow these best practices to achieve faster data retrieval and efficiency. 

Image: Shutterstock / Built In
Rohit Thakur Rohit Thakur
Updated on March 12, 2024

Beginner’s Guide to VGG16 Implementation in Keras

VGG16 is a convolutional neural net architecture that’s used for image recognition. It utilizes 16 layers with weights and is considered one of the best vision model architectures to date.

Image: Shutterstock / Built In
Frank H. McCourt, Jr. and Michael J. Casey Frank H. McCourt, Jr. and Michael J. Casey
Updated on March 12, 2024

What If We Lived in an Open Data Economy?

In an excerpt from their latest book, our experts imagine a world where we’re in control of our own online experience.

Okan Yenigün Okan Yenigün
Updated on March 11, 2024

DBSCAN Clustering Algorithm Demystified

Density-based spatial clustering of applications with noise (DBSCAN) is a clustering algorithm used to define clusters in a data set and identify outliers. Here’s how it works. 

Image: Shutterstock / Built In
Sachin Dev Sachin Dev
Updated on March 07, 2024

Understanding Overfitting vs. Underfitting in Machine Learning

Overfitting and underfitting are two problems that can occur when building a machine learning model and can lead to poor performance. Learn what causes them and how to fix it.

Image: Shutterstock / Built In

Related Topics