What Is Machine Learning?

Machine learning is a subset of artificial intelligence that gives systems the ability to learn and optimize processes without having to be consistently programmed. Simply put, machine learning uses data, statistics and trial and error to “learn” a specific task without ever having to be specifically coded for the task.

What Is Machine Learning?

Machine learning is a subfield of artificial intelligence in which systems “learn” through data, statistics and trial and error to optimize processes and innovate at quicker rates. Through machine learning, computers can apply human-like reasoning and decision-making to help solve some of the world’s toughest problems, ranging from cancer research to climate change.

Most computer programs rely on code to tell them what to execute or what information to retain. This is known as explicit knowledge, which contains anything that is easily written or recorded like textbooks, videos and manuals. With machine learning, computers gain tacit knowledge, or knowledge gained from personal experience and context. This type of knowledge is hard to transfer from one person to the next via written or verbal communication.

Facial recognition is a type of tacit knowledge. We recognize a person’s face, but it can be difficult for us to describe how or why we recognize it. We rely on our personal knowledge banks to connect the dots and immediately recognize a person based on their face. Another example is riding a bike: It’s much easier to show someone how to ride a bike than it is to explain it.

Computers no longer have to rely on billions of lines of code to carry out calculations. Machine learning gives computers the power of tacit knowledge to make connections, discover patterns and make predictions based on what they learned in the past. Machine learning’s use of tacit knowledge has made it a go-to technology for almost every industry from fintech to government.

Machine Learning vs. Deep Learning

Deep learning is a subfield within machine learning, and it’s gaining traction for its ability to extract features from data. Deep learning uses artificial neural networks (ANNs) to extract higher-level features from raw data. ANNs, though much different from human brains, were inspired by the way humans biologically process information. The learning a computer does is considered “deep” because the networks use layering to learn from and interpret raw information.

For example, deep learning is an important asset for image processing in everything from e-commerce to medical imagery. Google is equipping its programs with deep learning to discover patterns in images to display the correct image for whatever you search. If you search for a winter jacket, Google’s machine and deep learning will team up to discover patterns in images — sizes, colors, shapes, relevant brand titles — that display relevant jackets.

Deep learning is also making significant advancements in radiology, pathology and any medical sector that relies heavily on imagery. The technology leverages tacit knowledge — from studying millions of other scans — to immediately recognize disease or injury, saving doctors and hospitals both time and money.

How Does Machine Learning Work?

Machine learning compiles input data, which can be data gathered from training sessions or other sources, such as data set search engines, .gov websites and open data registries like that of Amazon Web Services. This data serves the same function that prior experiences do for humans, giving machine learning models historical information to work with when making future determinations.

Algorithms then analyze this data, searching for patterns and trends that allow them to make accurate predictions. In this way, machine learning can glean insights from the past to anticipate future happenings. Typically, the larger the data set that a team can feed to machine learning software, the more accurate the predictions.

The idea is that machine learning algorithms should be able to perform these tasks on their own, requiring minimal human intervention. This speeds up various processes as machine learning comes to automate many aspects of different industries.

Types of Machine Learning

There is a range of machine learning types that vary based on several factors like data size and diversity. Below are a few of the most common types of machine learning under which popular machine learning algorithms can be categorized.

Supervised Learning

Supervised learning involves mathematical models of data that contain both input and output information. Machine learning computer programs are constantly fed these models, so the programs can eventually predict outputs based on a new set of inputs.

Regression and classification are two of the more popular analyses under supervised learning. Regression analysis is used to discover and predict relationships between outcome variables and one or more independent variables. Commonly known as linear regression, this method provides training data to help systems with predicting and forecasting. Classification is used to train systems to identify an object and place it in a sub-category. For instance, email filters use machine learning to automate incoming email flows for primary, promotion and spam inboxes.

Unsupervised Learning

Unsupervised learning involves working with data that contains only inputs, and then uncovering patterns or structures within the data by organizing it into similar clusters or groups. The method learns from previous test data that hasn’t been labeled or categorized and will then group the raw data based on commonalities. Cluster analysis uses unsupervised learning to sort through giant lakes of raw data and group certain data points together. Clustering is a popular tool for data mining, and it is used in everything from genetic research to creating virtual social media communities of like-minded individuals.

Semi-Supervised Learning

Semi-supervised learning falls in between unsupervised and supervised learning. With this technique, programs are fed a mixture of labeled and unlabeled data that not only speeds up the machine learning process, but helps machines identify objects and learn with increased accuracy.

Typically, programmers introduce a small amount of labeled data with a large percentage of unlabeled information, and the computer will have to use the groups of structured data to cluster the rest of the information. Labeling supervised data is seen as a massive undertaking because of high costs and hundreds of hours spent.

Machine Learning Algorithms

While there are many machine learning algorithms available for solving different problems, these are some of the most common types.

Neural Networks

Neural networks function similarly to the human brain, analyzing data to detect patterns and relationships. This process allows machines to learn from their mistakes and improve their performance over time.

Linear Regression

Linear regression is used to develop accurate predictions when the outputs are continuous. It aims to represent the relationship between a dependent variable and at least one independent variable with a straight line.

Logistic Regression

Logistic regression is used to make predictions for binary classification problems — where the outputs belong to one category or another (true/false, yes/no, etc.). Common examples include spam filtering and determining whether a patient has a specific disease.

Decision Trees

Decision trees evaluate data sets by asking a series of questions that can be organized into the branches of a tree diagram. As a result, decision trees can be used to both predict the value of an outcome and sort data into categories.

Random Forest

Random forest solves predictive modeling and classification problems by combining the results of many decision trees. This approach increases the accuracy of a model and accounts for common problems like overfitting.

Naive Bayes

Naive Bayes works under the assumption that features are independent, meaning one feature doesn’t affect another — each feature has an equal chance of impacting the outcome. This lets Naive Bayes break down large data sets into manageable calculations.

K-Nearest Neighbors (KNN)

K-nearest neighbors (KNN) classifies data points based on their proximity to other data points. It assumes that similar data points are close to each other, calculating the distance between a value and nearby data points and assigning it to the most prevalent category.

K-means Clustering

K-means organizes unlabeled data into clusters by grouping nearby data points together. It accomplishes this by selecting a center point — real or imagined — for each cluster and sorting data points based on the center point they’re closest to.

Machine Learning Examples and Applications

Financial Services

The financial services industry is championing machine learning for its unique ability to speed up processes with a high rate of accuracy. What has taken humans hours, days or even weeks to accomplish can now be executed in minutes. Cards issued in the U.S. made up over 42 percent of fraud losses globally, according to a Nilson report. Ensuring these transactions are more secure, Visa has embraced AI and machine learning to detect fraud and other digital threats.

Additionally, machine learning is used by lending and credit card companies to manage and predict risk. These computer programs take into account a loan seeker’s past credit history, along with thousands of other data points like cell phone and rent payments, to deem the risk of the lending company. By taking other data points into account, lenders can offer loans to a much wider array of individuals who couldn’t get loans with traditional methods.

Trading firms are using machine learning to amass a huge lake of data and determine the optimal price points to execute trades. These complex high-frequency trading algorithms take thousands, if not millions, of financial data points into account to buy and sell shares at the right moment.

Healthcare

The healthcare industry uses machine learning to manage medical information, discover new treatments and even detect and predict disease. With machine learning computer systems, medical professionals can easily view patient medical records without having to dig through files or have chains of communication with other areas of the hospital. Updated medical systems can now pull up pertinent health information on each patient in the blink of an eye.

AI and machine learning can automate maintaining health records, following up with patients and authorizing insurance — tasks that unnecessarily take up vital healthcare resources for 68 percent of physicians, according to the American Medical Association.

Machine learning-enabled AI tools are working alongside drug developers to generate drug treatments at faster rates. These machine learning tools are fed millions of data points and configure them to help researchers view which compounds are successful and which aren’t. Instead of spending millions of human hours on each trial, machine learning technologies can produce successful drug compounds in weeks or months.

Radiology and pathology departments all over the world use machine learning to analyze CT and X-ray scans and find diseases. After being fed thousands of images of diseases through a mixture of supervised, unsupervised or semi-supervised models, advanced machine learning systems can diagnose diseases at higher rates than humans. Machine learning has also been used to predict deadly viruses, like Ebola and Malaria, and is used by the CDC to track instances of the flu virus every year.

Social Media

Social media companies leverage machine learning to create a sense of community, as well as weed out bad actors and disinformation. Machine learning fosters the former by looking at pages, tweets, topics and other features that an individual likes and suggesting other topics or community pages based on those likes. It’s essentially using your preferences as a way to power a social media recommendation engine.

The spread of misinformation in politics has prompted social media companies to use machine learning to quickly identify harmful patterns of false information, flag malicious bots, view reported content and delete when necessary.

Retail and E-commerce

The retail industry relies on machine learning for its ability to optimize sales and gather data on individualized shopping preferences. Machine learning offers retailers and online stores the ability to make purchase suggestions based on a user’s clicks, likes and past purchases. Once customers feel like retailers understand their needs, they are more likely to stay with that company and purchase more items.

Visual search is becoming a huge part of the shopping experience, too. Instead of typing in queries, customers can now upload an image to show the computer exactly what they’re looking for. Machine learning will analyze the image and produce search results based on its findings.

Machine learning has also been an asset in predicting customer trends and behaviors. These machines look holistically at individual purchases to determine what types of items are selling and what items will be selling in the future. For example, maybe a new food has been deemed a “super food.” A grocery store’s systems might identify increased purchases of that product and send customers coupons or targeted advertisements for all variations of that item. Additionally, a system could look at individual purchases to send you future coupons.

History of Machine Learning

The brief timeline below tracks the development of machine learning from its beginnings in the 1950s to its maturation during the twenty-first century.

1950

Alan Turing invents the Turing Test, which consists of three terminals — a computer-operated one and two human-operated ones. The goal is for the computer to trick a human interviewer into thinking it is also human by mimicking human responses to questions. The test sparks debate around whether computers possess artificial intelligence.

1956

Arthur Samuel publicly reveals a computer that can determine the optimal moves to make in a checker match.

1957

Frank Rosenblatt creates the first neural network for computers, known as the perceptron. This invention enables computers to reproduce human ways of thinking, forming original ideas on their own.

1962

Samuel builds on previous versions of his checkers program, leading to an advanced system made for the IBM 7094 computer. That same year, the computer defeats checkers master Robert Nealy in a match.

1979

Descending from a line of robots designed for lunar missions, the Stanford cart emerges in an autonomous format. The machine relies on 3D vision and pauses after each meter of movement to process its surroundings. Without any human help, this robot successfully navigates a chair-filled room to cover 20 meters in five hours.

1981

Gerald Dejong explores the concept of explanation-based learning (EBL). This approach involves providing a computer with training data, which it analyzes to develop a rule for filtering out unnecessary information. The idea is that this data is to a computer what prior experience is to a human being.

1985

Researcher Terry Sejnowksi creates an artificial neural network of 300 neurons and 18,000 synapses. Called NetTalk, the program babbles like a baby when receiving a list of English words, but can more clearly pronounce thousands of words with long-term training.

1990s

The 1990s marks a shift in the realm of machine learning. Scientists focus less on knowledge and more on data, building computers that can glean insights from larger data sets.

1997

Scientists at IBM develop a computer called Deep Blue that excels at making chess calculations. The program defeats world chess champion Garry Kasparov over a six-match showdown.

2006

The term “deep learning” is coined by Geoffrey Hinton, a long-time computer scientist and researcher in the field of AI. He applies the term to the algorithms that enable computers to recognize specific objects when analyzing text and images.

2010

Microsoft releases a motion-sensing device called Kinect for the Xbox 360. The device contains cameras and sensors that allow it to recognize faces, voices and movements. As a result, Kinect removes the need for physical controllers since players become the controllers.

2011

IBM’s Watson competes on Jeopardy! against two of the show’s most decorated champions. That same year, Google develops Google Brain, which earns a reputation for the categorization capabilities of its deep neural networks.

2012

Computer scientists at Google’s X lab design an artificial brain featuring a neural network of 16,000 computer processors. The network applies a machine learning algorithm to scan YouTube videos on its own, picking out the ones that contain content related to cats.

2014

Facebook unveils its new face recognition tool DeepFace. Composed of a deep network of millions of data points, DeepFace leverages 3D face modeling to recognize faces in images in a way very similar to that of humans.

2017

Google’s AI algorithm AlphaGo specializes in the complex Chinese board game Go. The algorithm achieves a close victory against the game’s top player Ke Jie in 2017. This win comes a year after AlphaGo defeated grandmaster Lee Sedol, taking four out of the five games.

2018

OpenAI releases GPT-1 and Google releases BERT, two language models based on transformer networks. These models signal the rise of language models, paving the way for large language models to take on increased importance in machine learning research.

2019

Microsoft introduces the Turing Natural Language Generation model, which contains 17 billion parameters. Google also releases a family of convolutional neural networks called EfficientNets, which perform just as well as larger models while remaining compact.

2020

OpenAI releases GPT-3, which contains 175 billion parameters. Combining natural language processing and machine learning, GPT-3 displays far more advanced abilities than GPT-2 in understanding human language and generating human-like text.

2021

OpenAI launches DALL-E, a multimodal tool that can produce images based on text prompts. However, it doesn’t become popular until the 2022 release of DALL-E 2.

2022

Google DeepMind reveals AlphaTensor, a system meant to speed up the process of building optimal algorithms that can handle complex tasks. In addition, OpenAI releases ChatGPT, a chatbot that offers a more conversational form of generative AI for users.

2023

OpenAI releases GPT-4 and Anthropic releases Claude AI, both of which can process and produce different data like text, images and audio. These tools lead efforts to make multimodal AI more accessible and commonplace.

2024

Google releases a family of multimodal models called Gemini, along with a chatbot by the same name, which was formerly known as Bard. These models come in various sizes and with different capabilities, and are being incorporated into several Google products, including Gmail, Docs and its search engine.

Frequently Asked Questions

What is machine learning in simple terms?

Machine learning is a type of artificial intelligence that focuses on helping computers learn how to complete tasks they haven’t been programmed for. Similar to how humans learn from experience, machine learning-powered computers gather insights from completing tasks and analyzing data and apply what they’ve learned to master new tasks.

What is the difference between AI and ML?

Artificial intelligence (AI) is a broad field that refers to the ability of a machine to complete tasks that typically require human intelligence. Machine learning (ML) is a subfield of artificial intelligence that specifically refers to machines that can complete tasks that require human intelligence without being explicitly programmed to do so.

What are the types of machine learning?

The three main types of machine learning are supervised, unsupervised and semi-supervised learning.

What are examples of machine learning?

Examples of machine learning include pattern recognition, image recognition, linear regression and cluster analysis.

Where is ML used in real life?

Real-world applications of machine learning include emails that automatically filter out spam, facial recognition features that secure smartphones, algorithms that help credit card companies detect fraud and computer systems that assist healthcare professionals in diagnosing diseases.

What Is Machine Learning?

Machine Learning vs. Deep Learning

How Does Machine Learning Work?

Types of Machine Learning

Supervised Learning

Unsupervised Learning

Semi-Supervised Learning

Machine Learning Algorithms

Neural Networks

Linear Regression

Logistic Regression

Decision Trees

Random Forest

Naive Bayes

K-Nearest Neighbors (KNN)

K-means Clustering

Machine Learning Examples and Applications

Financial Services

Healthcare

Social Media

Retail and E-commerce

History of Machine Learning

1950

1956

1957

1962

1979

1981

1985

1990s

1997

2006

2010

2011

2012

2014

2017

2018

2019

2020

2021

2022

2023

2024

Frequently Asked Questions

What is machine learning in simple terms?

What is the difference between AI and ML?

What are the types of machine learning?

What are examples of machine learning?

Where is ML used in real life?

Recent Artificial Intelligence Articles