Data Science Articles

Sorted By: Most Recent
Rahul Agarwal Rahul Agarwal
Updated on March 29, 2024

ROC Curves and AUC: The Ultimate Guide

ROC curves are one of the most common evaluation metrics for checking a classification model’s performance. This guide will help you to truly understand how ROC curves and AUC work together.

Image: Shutterstock / Built In
Asim Rahal Asim Rahal
Updated on March 20, 2024

Don’t Be Compliance-First. Be Risk-First Instead.

A risk-first approach focuses on a company’s most significant compliance risks rather than an arbitrary list of regulations.

Image: Shutterstock / Built In
Michael Galarnyk Michael Galarnyk
Updated on March 14, 2024

How to Use the Z-Table

A z-table, or standard normal table, reveals what percentage of values fall below a certain z-score in a normal distribution. Here’s how to use one and create your own.

Image: Shutterstock / Built In
Joel Hereth Joel Hereth
Updated on March 12, 2024

How to Optimize SQL for Large Data Sets

Optimizing SQL for large data sets is an important step in managing the performance of your database. Follow these best practices to achieve faster data retrieval and efficiency. 

Image: Shutterstock / Built In
Rohit Thakur Rohit Thakur
Updated on March 12, 2024

Beginner’s Guide to VGG16 Implementation in Keras

VGG16 is a convolutional neural net architecture that’s used for image recognition. It utilizes 16 layers with weights and is considered one of the best vision model architectures to date.

Image: Shutterstock / Built In
Frank H. McCourt, Jr. and Michael J. Casey Frank H. McCourt, Jr. and Michael J. Casey
Updated on March 12, 2024

What If We Lived in an Open Data Economy?

In an excerpt from their latest book, our experts imagine a world where we’re in control of our own online experience.

Okan Yenigün Okan Yenigün
Updated on March 11, 2024

DBSCAN Clustering Algorithm Demystified

Density-based spatial clustering of applications with noise (DBSCAN) is a clustering algorithm used to define clusters in a data set and identify outliers. Here’s how it works. 

Image: Shutterstock / Built In
Sachin Dev Sachin Dev
Updated on March 07, 2024

Understanding Overfitting vs. Underfitting in Machine Learning

Overfitting and underfitting are two problems that can occur when building a machine learning model and can lead to poor performance. Learn what causes them and how to fix it.

Image: Shutterstock / Built In
Bushra Anjum & Ken Hu Bushra Anjum & Ken Hu
Updated on March 05, 2024

How to Build an Agile Framework for Machine Learning Projects

Machine learning projects can get unwieldy without a clear framework. Our experts explain how they built one based on the Agile methodology.

Image: Shutterstock / Built In
Stephen Adesina Stephen Adesina
Updated on March 04, 2024

Python Circular Import Error Solved

Python circular import is an error that occurs when two or more modules mutually depending on each other try to import before fully loading. Here’s how to solve it.

Image: Shutterstock / Built In

Related Topics