Big Data Articles

Sorted By: Most Recent
Abdishakur Hassan Abdishakur Hassan
Updated on March 08, 2023

What Is Cluster Analysis?

We use cluster analysis in machine learning, image analysis, data mining and pattern recognition. Here’s how it works and when you’ll find it most useful.

Image: Shutterstock / Built In
Kaycee Lai Kaycee Lai
Updated on February 09, 2023

What Is a Data Lake? Is It Right for Your Company?

A few questions to help determine if it’s the data architecture you really need.

Abdishakur Hassan Abdishakur Hassan
Updated on February 06, 2023

What Is Extrapolation?

Extrapolation is a statistical technique used in data science to estimate values of data points beyond the range of the known values in the data set.

Image: Shutterstock / Built In
Chris Dowsett Chris Dowsett
Updated on February 03, 2023

What Is MongoDB?

MongoDB is an open-source, document-oriented NoSQL database designed to handle large amounts of data and provide fast performance.

Image: Shutterstock / Built In
Jye Sawtell-Rickson Jye Sawtell-Rickson
Updated on January 23, 2023

What Is the Fibonacci Sequence?

The Fibonacci sequence is a series of numbers in which each number equals the sum of the two that precede it. For example, 0, 1, 1, 2, 3, 5, 8, 13, 21 and so on.

Image: Shutterstock / Built In
Jye Sawtell-Rickson Jye Sawtell-Rickson
Updated on January 17, 2023

What Is PyTorch?

PyTorch is an open-source machine learning framework used for training deep neural networks.

Image: Shutterstock / Built In
Sadrach Pierre Sadrach Pierre
Updated on November 14, 2022

How to Form Clusters in Python: Data Clustering Methods

Every data scientist should know how to form clusters in Python since it’s a key analytical technique in a number of industries. Here’s a guide to getting started.

Image: Shutterstock
Anthony Figueroa Anthony Figueroa
Updated on October 25, 2022

Correlation Is Not Causation

Correlation occurs when two variables change at the same time, while causation is when a change in one variable causes the other to change. Here’s why you need to understand the difference.

Image: Shutterstock / Built In
Ying Wang Ying Wang
Updated on August 26, 2022

A Guide to Resolving Data Divergence in SQL

Data divergence, meaning differences in results generated from old and new versions of data architecture, results from a number of issues in the pipeline. Fortunately, a relatively straightforward method exists for resolving the problem.

Sara A. Metwalli Sara A. Metwalli
Updated on August 23, 2022

5 Ways to Learn Git and Version Control

Version control is hard to learn, but it doesn’t have to be! These five resources can help you get Git.

Related Topics