Senior Machine Learning Engineer - AI-Assisted Data Annotation

Posted Yesterday
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka, IND
In-Office
Senior level
Artificial Intelligence • Software • Automation
The Role
The Senior Machine Learning Engineer will develop AI-assisted annotation pipelines, leverage large models for training data, and validate outputs for model training. Responsibilities include overseeing the automated annotation process, optimizing inference pipelines, and collaborating with various teams to enhance annotation quality and system performance.
Summary Generated by Built In

Join ABBYY and be part of a team that celebrates your unique work style. With flexible work options, a supportive team, and rewards that reflect your value, you can focus on what matters most – driving your growth, while fueling ours.

Our commitment to respect, transparency, and simplicity means you can trust us to always choose to do the right thing.

As a trusted partner for purpose-built AI and intelligent automation, we solve highly complex problems for our enterprise customers and put their information to work to transform the way they do business. Over 10,000 customers trust ABBYY, including many Fortune 500 ones. You will work on further developing a portfolio already containing client names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK.

About the Role 

We are seeking a Senior Machine Learning Engineer – AI-Assisted Data Annotation to own the automated annotation track within ABBYY’s Document AI Data team. 

This role sits at the intersection of large model capabilities and production data engineering, leveraging LLMs and vision-language models to generate high-quality training data at scale. You will design and build AI-assisted annotation pipelines, ensuring outputs are accurate, measurable, and reliable for downstream model training. 

This is an ideal role for engineers who combine deep model expertise with strong system-building instincts and thrive in fast-moving, experimental environments. 

Key Responsibilities 

Technical Development & Innovation 

  • Design and implement AI-powered annotation pipelines using large models to generate ground truth labels at scale 
  • Develop and refine prompting strategies, few-shot examples, and fine-tuning approaches to improve accuracy and consistency 
  • Build systems for label verification, confidence scoring, and quality validation 
  • Evaluate which tasks are suitable for automated annotation vs. human review, and define decision criteria 
  • Create evaluation frameworks to benchmark automated annotations against human-labeled data 
  • Continuously improve annotation quality using feedback from human review workflows 

Project Ownership & Leadership 

  • Own the automated annotation track end-to-end, from architecture through production monitoring 
  • Drive technical decisions across model selection, pipeline design, and validation strategies 
  • Define integration points with platform infrastructure and model serving systems 
  • Collaborate with Data Operations to design human-in-the-loop workflows for efficient review 
  • Contribute to roadmap planning with Principal-level technical leadership 

Infrastructure & Scale 

  • Build and optimize large-scale inference pipelines for processing millions of documents 
  • Implement monitoring and alerting for quality degradation and system failures 
  • Design batching, caching, and fallback mechanisms to balance cost, throughput, and accuracy 
  • Collaborate with Platform teams on model serving, APIs, and infrastructure scaling 
  • Maintain clear documentation of annotation strategies, metrics, and known limitations 

Qualifications 

Education & Experience 

  • MS or PhD in Computer Science, Engineering, Mathematics, or related field 
  • 5+ years of experience in Machine Learning / AI, with focus on:  
  • Large Language Models (LLMs) 
  • Vision-Language Models (VLMs) 
  • Data annotation or labeling systems 
  • Demonstrated success using large AI models to automate annotation at production scale 
  • Strong background in evaluation design and quality measurement 

Technical Expertise 

  • Deep expertise in LLMs and VLMs, including prompting, instruction tuning, and output evaluation 
  • Strong understanding of document understanding tasks (classification, extraction, layout analysis, semantic parsing) 
  • Experience designing label quality metrics, confidence scoring, and agreement analysis 
  • Strong programming skills in Python and proficiency with PyTorch or similar frameworks 
  • Experience with large-scale inference pipelines and model serving systems 
  • Familiarity with human-in-the-loop annotation systems and automation trade-offs 

Leadership & Communication 

  • Proven ability to independently own complex technical workstreams 
  • Strong collaboration with data operations, platform, and modeling teams 
  • Ability to clearly communicate quality trade-offs and system behavior to diverse stakeholders 
  • Rigorous, data-driven problem-solving approach 

Here are some of our local benefits: 

  • Comprehensive medical, accidental, and life insurance 
  • Weekly wellness sessions to support your physical and mental well-being 
  • A generous paid time off policy 

 

 

Join ABBYY, and you will:

Love how you work

  • We provide remote and hybrid working options to fit all lifestyles.
  • We use flexible hours across most of our teams to allow you to find your own definition of balance.
  • Encouraging a culture of giving, we provide two paid volunteering days off every year so you can take time to contribute to the causes you care about.
  • To ensure your family is cared for, we offer paid parental leave in all our locations.

Love whom you work with

  • We are a global team of 600+ colleagues, spread across 15 countries on four continents.
  • With colleagues representing 30+ nationalities, our workforce reflects the world.
  • Innovation and excellence run through our veins. Our teams gather the expertise which has garnered ABBYY more than 140 technology patents.
  • We are guided by the values of respect, transparency, and simplicity.
  • "Team Environment" is in the top three highest-scoring drivers of engagement across all of our departments.

Love what you work on

  • We are a company with more than 35 years of experience in the technology market;
  • Over 10,000 customers trust ABBYY, including many Fortune 500 ones, with names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK;
  • We have modernized the capture market by creating the first low-code/no-code IDP platform.
  • Our Machine Learning, Natural Language Processing, Computer Vision Technologies, and a marketplace built with AI, can transform any document in any process;
  • Top Analyst firms recognize ABBYY's market leadership, including Gartner, Everest PEAK Matrix ® Assessment, ISG Intelligent Automation Lens, and NelsonHall, amongst others.

ABBYY is an Equal Employment Opportunity employer that values the strength that diversity brings to the workplace. To learn more about our commitment to Diversity and Inclusion, check out the careers section on our website.

Skills Required

  • MS or PhD in Computer Science, Engineering, Mathematics, or related field
  • 5+ years of experience in Machine Learning / AI
  • Deep expertise in Large Language Models (LLMs) and Vision-Language Models (VLMs)
  • Strong programming skills in Python
  • Experience with large-scale inference pipelines and model serving systems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Austin, Texas
923 Employees
Year Founded: 1989

What We Do

ABBYY puts your information to work. We help enterprises and organizations to transform their data into intelligent, actionable outcomes, so they can make smart decisions faster and drive better results. Our intelligent automation solutions employ AI that is purpose-built for the enterprise—created with our customers in mind, supporting over 200 languages in real time. ABBYY intelligent document processing transforms data from any document, in any format or language, any time, into data that drives processes and decision-making. ABBYY Process Intelligence delivers process-related insights and monitoring to improve business process execution. For more information, please visit the ABBYY website

Similar Jobs

Atlassian Logo Atlassian

Senior Data Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
Bengaluru, Bengaluru Urban, Karnataka, IND
11000 Employees

CSC Logo CSC

Assistant Manager - Tax Accounting

Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
8500 Employees

Micron Technology Logo Micron Technology

Staff Engineer

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
Bengaluru, Bengaluru Urban, Karnataka, IND
45000 Employees

TransUnion Logo TransUnion

Consultant

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
2 Locations
13000 Employees

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account