Big Data Articles

Sorted By: Most Recent
Alan Simon Alan Simon
Updated on March 15, 2023

What Is Data Warehousing? Understand the Importance of Data Structures and Architecture.

Data warehousing is the aggregation and storage of data in one place, enabling data-driven decision-making. Here’s what you need to know.

Image: Shutterstock / Built In
Alan Simon Alan Simon
Updated on March 15, 2023

What Is Data Modeling? Common Tools, Techniques and Model Types.

Data modeling is the process of mapping out how major pieces of data will relate to one another before creating an analytical model. Here’s what you need to know.

Image: Shutterstock / Built In
Artturi Jalli Artturi Jalli
Updated on March 15, 2023

What Is MariaDB?

MariaDB is a fast, scalable open-source community-supported relational database management system that’s also an enhanced version of MySQL.

Image: Shutterstock / Built In
Chris Dowsett Chris Dowsett
Updated on March 15, 2023

What Is a Data Pipeline?

A data pipeline is a series of data processing steps. A data pipeline might move a data set from one data storage location to another data storage location.

Image: Shutterstock / Built In
Kaycee Lai Kaycee Lai
Updated on February 09, 2023

What Is a Data Lake? Is It Right for Your Company?

A few questions to help determine if it’s the data architecture you really need.

Chris Dowsett Chris Dowsett
Updated on February 03, 2023

What Is MongoDB?

MongoDB is an open-source, document-oriented NoSQL database designed to handle large amounts of data and provide fast performance.

Image: Shutterstock / Built In
Sadrach Pierre Sadrach Pierre
Updated on November 14, 2022

How to Form Clusters in Python: Data Clustering Methods

Every data scientist should know how to form clusters in Python since it’s a key analytical technique in a number of industries. Here’s a guide to getting started.

Image: Shutterstock
Ying Wang Ying Wang
Updated on August 26, 2022

A Guide to Resolving Data Divergence in SQL

Data divergence, meaning differences in results generated from old and new versions of data architecture, results from a number of issues in the pipeline. Fortunately, a relatively straightforward method exists for resolving the problem.

Rahul Agarwal Rahul Agarwal
Updated on August 22, 2022

How Can Data Scientists Use Parallel Processing?

Get the most out of your machine with these techniques.

Przemek Chojecki Przemek Chojecki
Updated on August 17, 2022

How to Spot Deepfake Technology

The AI that powers deepfakes poses all sorts of questions for consumers of media. Using a couple of simple principles, though, you can develop a sophisticated understanding of what you’re looking at.

Related Topics