Computer Vision Articles

Sorted By: Most Recent
Chinmay Bhalerao Chinmay Bhalerao
Updated on August 19, 2024

Vision Transformer: An Introduction

A vision transformer is a type of neural network that can be used for image classification and other computer vision tasks. Here’s what you need to know.

Image: Shutterstock / Built In
Aleksandr Ahramovich Aleksandr Ahramovich
Updated on August 19, 2024

Top Applications for Computer Vision in Sports

The rapidly developing field of computer vision has a number of sports-related uses. Our expert explains some of the most interesting ones.

Image: Shutterstock / Built In
Pranoy Radhakrishnan Pranoy Radhakrishnan
Updated on August 19, 2024

A Guide to Image Captioning in Deep Learning

Image captioning is the process of using natural language processing and computer vision to generate captions from an image. Learn more about how it works. 

Image: Shutterstock / Built In
Nell Watson Nell Watson
Updated on August 12, 2024

Here’s How AI Is Building a Robot-Filled World

In her latest book, researcher and tech ethicist Nell Watson details how generative AI is fueling the advancement of robotics.

Image: Shutterstock / Built In
Juan D. Ramirez Juan D. Ramirez
Updated on August 12, 2024

GPT-4o: Here’s What You Need to Know

The newest version has enhanced response time, vision capabilities and text processing, plus a cleaner user interface.

Image: Koshiro K / Shutterstock / Built In
Young Entrepreneur Council Young Entrepreneur Council
Updated on April 22, 2024

8 Industries Poised to Benefit From Augmented and Virtual Reality

Members of Young Entrepreneur Council list the industries they think should make better use of augmented and virtual reality technology.

Mrinal Tyagi Mrinal Tyagi
Updated on April 02, 2024

Histogram of Oriented Gradients: An Overview

Histogram of oriented gradients (HOG) is a feature descriptor used in computer vision and image processing for object detection. Learn how it works.

Image: Shutterstock / Built In
Chinmay Bhalerao Chinmay Bhalerao
Updated on December 19, 2023

A Guide to Python Tesseract

Tesseract is an optical character recognition engine used to extract text from images, and it can be accessed in Python through the library pytesseract. Here’s what to know.

Image: Shutterstock / Built In
Jacob Biba Jacob Biba
Updated on December 14, 2023

What Is Machine Vision?

Machine vision helps robots see and recognize their surroundings so they can perform more complex tasks.

Image: Shutterstock / Built In
Chinmay Bhalerao Chinmay Bhalerao
Updated on November 06, 2023

A Deep Dive Into Non-Maximum Suppression (NMS)

Non-maximum suppression (NMS) is a post-processing technique that is used in object detection tasks to eliminate duplicate detections and select bounding boxes.

Image: Shutterstock / Built In