Data Science Articles

Sorted By: Most Recent
Sohail Hosseini Sohail Hosseini
Updated on August 04, 2025

How to Use Loc and iLoc in Pandas: A Guide

The .loc[] and .iloc[] properties in Pandas are used to access specific rows and columns in a pandas DataFrame (or slice a data set). The .loc[] property is used for label indexing, while the .iloc[] property is used for integer indexing.

Image: Shutterstock / Built In
David Klempfner David Klempfner
Updated on August 04, 2025

Two’s Complement: A Guide

Two’s complement is a binary encoding method used to represent signed integers in computing systems.

Image: Shutterstock / Built In
Giorgos Myrianthous Giorgos Myrianthous
Updated on August 01, 2025

Fact Table vs. Dimension Table: What’s the Difference?

A fact table contains quantitative data from a business process. Dimension tables store qualitative data that provide context for the facts. Together, they are the core components of the star schema used in data warehouse modeling.

Image: Shutterstock / Built In
Gianluca Malato Gianluca Malato
Updated on August 01, 2025

An Introduction to the Shapiro-Wilk Test for Normality

The Shapiro-Wilk test is a hypothesis test applied to a sample with the null hypothesis that the sample was generated from a normal distribution.

Image: Shutterstock / Built In
Alex Williams Alex Williams
Updated on August 01, 2025

What Is a Data Governance Framework?

A data governance framework is a series of regulations and role assignments that assure cooperation in a company’s data management.

Image: Shutterstock / Built In
Julia Zolotarev Julia Zolotarev
Updated on August 01, 2025

What Is a Non-Relational Database?

A non-relational database (or NoSQL database) is a data storage system that allows flexible, schema-less organization of information using formats like documents, key-value pairs, graphs or wide-column structures.

Image: Shutterstock / Built In
Artem Oppermann Artem Oppermann
Updated on August 01, 2025

What Are Data Silos?

Data silos are isolated collections of data within an organization that are not easily accessible by other teams or systems, hindering collaboration and unified data analysis.

Image: Shutterstock / Built In
Sara A. Metwalli Sara A. Metwalli
Updated on July 31, 2025

What Is Data Labeling in Machine Learning?

Data labeling is the process of annotating raw data with meaningful tags or classifications so machine learning models can learn to recognize patterns and make accurate predictions.

Image: Shutterstock / Built In
Edoardo Romani Edoardo Romani
Updated on July 31, 2025

What Is Diagnostic Analytics?

Diagnostic analytics is the process of examining historical data to identify the root causes behind outcomes, trends or anomalies.

Image: Shutterstock / Built In
Artem Oppermann Artem Oppermann
Updated on July 31, 2025

What Is Object-Relational Mapping (ORM)?

Object-relational mapping (ORM) is a programming technique that lets developers use object-oriented code to interact with relational databases, by translating objects to and from database-compatible formats.

Image: Shutterstock / Built In

Related Topics