Big Data Articles

Sorted By: Most Recent
Aleksandras Šulženko Aleksandras Šulženko
Updated on June 14, 2023

How Governments Can Use Alternative Data for Policymaking

The increasing digitization of society means a wealth of public data exists beyond traditional sources. Our expert argues that governments can use this data to make faster, more flexible policy decisions.

Image: Shutterstock / Built In
Sadrach Pierre Sadrach Pierre
Updated on May 15, 2023

Streamlit Tutorial: A Beginner’s Guide to Building Machine Learning-Based Web Applications in Python

Analytics dashboards are a great way for data scientists to communicate insights to companies, but they can often be expensive and time-consuming to build. Streamlit is an easy-to-use library for Python that simplifies the process.

Erdem İŞBİLEN Erdem İŞBİLEN
Updated on April 12, 2023

What Is Maximum Likelihood Estimation (MLE)?

In statistics, we can use maximum likelihood estimation (MLE) to estimate the parameters of models. Here’s how MLE works.

Image: Shutterstock / Built In
Artem Oppermann Artem Oppermann
Updated on April 06, 2023

What Is CatBoost?

CatBoost is a machine learning gradient-boosting algorithm that’s particularly effective for handling data sets with categorical features. Our expert explains how CatBoost works and why it’s so effective.

Image: Shutterstock / Built In
Alex Williams Alex Williams
Updated on March 16, 2023

What Is a Knowledge Graph? Examples, Uses and More.

Knowledge graphs are becoming increasingly common thanks to their wide range of applications across industries. This guide introduces you to their basic principles and some examples.

Image: Shutterstock / Built In
Artturi Jalli Artturi Jalli
Updated on March 15, 2023

What Is Cassandra?

Cassandra is an open-source, scalable NoSQL database with high-performance capabilities.

Image: Shutterstock / Built In
Artturi Jalli Artturi Jalli
Updated on March 15, 2023

What Is MariaDB?

MariaDB is a fast, scalable open-source community-supported relational database management system that’s also an enhanced version of MySQL.

Image: Shutterstock / Built In
Chris Dowsett Chris Dowsett
Updated on March 15, 2023

What Is a Data Pipeline?

A data pipeline is a series of data processing steps. A data pipeline might move a data set from one data storage location to another data storage location.

Image: Shutterstock / Built In
Artturi Jalli Artturi Jalli
Updated on March 15, 2023

What Is HBase?

HBase is a non-relational database management system for real-time data processing that runs on top of the Hadoop distributed file system

Image: Shutterstock / Built In
Julia Zolotarev Julia Zolotarev
Updated on March 15, 2023

What Is a Non-Relational Database?

Non-relational databases (NoSQL databases) are data stores that are either schema-free, or have relaxed schemas that allow for changes in the data structure.

Image: Shutterstock / Built In

Related Topics