ABBYY Jobs

Senior Machine Learning Engineer - AI-Assisted Data Annotation

ABBYY

Senior Machine Learning Engineer - AI-Assisted Data Annotation

Reposted 4 Days Ago

Be an Early Applicant

Bangalore, Bengaluru Urban, Karnataka, IND

In-Office

Senior level

Artificial Intelligence • Software • Automation

The Role

The Senior Machine Learning Engineer will develop AI-assisted annotation pipelines, leverage large models for training data, and validate outputs for model training. Responsibilities include overseeing the automated annotation process, optimizing inference pipelines, and collaborating with various teams to enhance annotation quality and system performance.

Summary Generated by Built In

Join ABBYY and be part of a team that celebrates your unique work style. With flexible work options, a supportive team, and rewards that reflect your value, you can focus on what matters most – driving your growth, while fueling ours.

Our commitment to respect, transparency, and simplicity means you can trust us to always choose to do the right thing.

As a trusted partner for purpose-built AI and intelligent automation, we solve highly complex problems for our enterprise customers and put their information to work to transform the way they do business. Over 10,000 customers trust ABBYY, including many Fortune 500 ones. You will work on further developing a portfolio already containing client names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK.

About the Role

We are seeking a Senior Machine Learning Engineer – AI-Assisted Data Annotation to own the automated annotation track within ABBYY’s Document AI Data team.

This role sits at the intersection of large model capabilities and production data engineering, leveraging LLMs and vision-language models to generate high-quality training data at scale. You will design and build AI-assisted annotation pipelines, ensuring outputs are accurate, measurable, and reliable for downstream model training.

This is an ideal role for engineers who combine deep model expertise with strong system-building instincts and thrive in fast-moving, experimental environments.

Key Responsibilities

Technical Development & Innovation

Design and implement AI-powered annotation pipelines using large models to generate ground truth labels at scale

Develop and refine prompting strategies, few-shot examples, and fine-tuning approaches to improve accuracy and consistency

Build systems for label verification, confidence scoring, and quality validation

Evaluate which tasks are suitable for automated annotation vs. human review, and define decision criteria

Create evaluation frameworks to benchmark automated annotations against human-labeled data

Continuously improve annotation quality using feedback from human review workflows

Project Ownership & Leadership

Own the automated annotation track end-to-end, from architecture through production monitoring

Drive technical decisions across model selection, pipeline design, and validation strategies

Define integration points with platform infrastructure and model serving systems

Collaborate with Data Operations to design human-in-the-loop workflows for efficient review

Contribute to roadmap planning with Principal-level technical leadership

Infrastructure & Scale

Build and optimize large-scale inference pipelines for processing millions of documents

Implement monitoring and alerting for quality degradation and system failures

Design batching, caching, and fallback mechanisms to balance cost, throughput, and accuracy

Collaborate with Platform teams on model serving, APIs, and infrastructure scaling

Maintain clear documentation of annotation strategies, metrics, and known limitations

Qualifications

Education & Experience

MS or PhD in Computer Science, Engineering, Mathematics, or related field

5+ years of experience in Machine Learning / AI, with focus on:

Large Language Models (LLMs)

Vision-Language Models (VLMs)

Data annotation or labeling systems

Demonstrated success using large AI models to automate annotation at production scale

Strong background in evaluation design and quality measurement

Technical Expertise

Deep expertise in LLMs and VLMs, including prompting, instruction tuning, and output evaluation

Strong understanding of document understanding tasks (classification, extraction, layout analysis, semantic parsing)

Experience designing label quality metrics, confidence scoring, and agreement analysis

Strong programming skills in Python and proficiency with PyTorch or similar frameworks

Experience with large-scale inference pipelines and model serving systems

Familiarity with human-in-the-loop annotation systems and automation trade-offs

Leadership & Communication

Proven ability to independently own complex technical workstreams

Strong collaboration with data operations, platform, and modeling teams

Ability to clearly communicate quality trade-offs and system behavior to diverse stakeholders

Rigorous, data-driven problem-solving approach

Here are some of our local benefits:

Comprehensive medical, accidental, and life insurance

Weekly wellness sessions to support your physical and mental well-being

A generous paid time off policy

Join ABBYY, and you will:

Love how you work

We provide remote and hybrid working options to fit all lifestyles.
We use flexible hours across most of our teams to allow you to find your own definition of balance.
Encouraging a culture of giving, we provide two paid volunteering days off every year so you can take time to contribute to the causes you care about.
To ensure your family is cared for, we offer paid parental leave in all our locations.

Love whom you work with

We are a global team of 600+ colleagues, spread across 15 countries on four continents.
With colleagues representing 30+ nationalities, our workforce reflects the world.
Innovation and excellence run through our veins. Our teams gather the expertise which has garnered ABBYY more than 140 technology patents.
We are guided by the values of respect, transparency, and simplicity.
"Team Environment" is in the top three highest-scoring drivers of engagement across all of our departments.

Love what you work on

We are a company with more than 35 years of experience in the technology market;
Over 10,000 customers trust ABBYY, including many Fortune 500 ones, with names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK;
We have modernized the capture market by creating the first low-code/no-code IDP platform.
Our Machine Learning, Natural Language Processing, Computer Vision Technologies, and a marketplace built with AI, can transform any document in any process;
Top Analyst firms recognize ABBYY's market leadership, including Gartner, Everest PEAK Matrix ® Assessment, ISG Intelligent Automation Lens, and NelsonHall, amongst others.

ABBYY is an Equal Employment Opportunity employer that values the strength that diversity brings to the workplace. To learn more about our commitment to Diversity and Inclusion, check out the careers section on our website.

Skills Required

MS or PhD in Computer Science, Engineering, Mathematics, or related field
5+ years of experience in Machine Learning / AI
Deep expertise in Large Language Models (LLMs) and Vision-Language Models (VLMs)
Strong programming skills in Python
Experience with large-scale inference pipelines and model serving systems

ABBYY Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about ABBYY and has not been reviewed or approved by ABBYY.

Flexible Benefits — Remote and hybrid options with flexible hours are positioned as core elements of the package and appear consistently in current job materials. This flexibility extends across locations where feasible.
Healthcare Strength — Health coverage includes medical, dental, and vision with an Employee Assistance Program, and is characterized as comprehensive with low-premium options in current postings. These elements indicate strong baseline healthcare support.
Leave & Time Off Breadth — The offering includes PTO, a large set of paid company holidays with floating holidays, paid parental leave, and dedicated volunteer days. This breadth supports time away for family, rest, and community.

Learn more about ABBYY's Compensation & Benefits →

ABBYY Insights

What's It Like to Work at ABBYY? ABBYY Culture & Values ABBYY Career Growth & Development What's the Work-Life Balance Like at ABBYY? ABBYY Leadership & Management ABBYY Company Growth, Stability & Outlook

View all jobs at ABBYY

View ABBYY Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Austin, Texas

923 Employees

Year Founded: 1989

What We Do

ABBYY puts your information to work. We help enterprises and organizations to transform their data into intelligent, actionable outcomes, so they can make smart decisions faster and drive better results. Our intelligent automation solutions employ AI that is purpose-built for the enterprise—created with our customers in mind, supporting over 200 languages in real time. ABBYY intelligent document processing transforms data from any document, in any format or language, any time, into data that drives processes and decision-making. ABBYY Process Intelligence delivers process-related insights and monitoring to improve business process execution. For more information, please visit the ABBYY website