Senior Data Quality Engineer, ML (India)

Posted 6 Days Ago
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka
In-Office
Senior level
Agency • Software • Consulting
Living at the intersection of creativity and technology.
The Role
As a Senior Data Quality Engineer for ML, you will evaluate outputs from large language models using Python and SQL, implement quality metrics, and automate evaluation pipelines, while collaborating with multidisciplinary teams to ensure software quality.
Summary Generated by Built In

Our AI/ML engineering team ensures Code and Theory delivers innovative, immersive web experiences that delight our clients and their customers. We are always striving to balance the demanding nature of working on cutting-edge technologies with the real-world demands of high performance, high security, and accessibility. Working in collaboration with our multi-disciplinary engineering, design, and quality assurance teams, you will build software that solves real-world problems for incredible clients. 

WHAT YOU’LL DO

  • Write Python and SQL scripts to evaluate outputs from large language models (LLMs)
  • Design and implement LLM-as-Judge evaluations with clear scoring rubrics (faithfulness, relevance, completeness, correctness)
  • Define and calculate quality metrics such as exact match, token-level F1, ROUGE, and subjective rubric scores
  • Build and maintain ground-truth datasets for benchmarking and regression testing
  • Automate evaluation pipelines and integrate them into CI/CD workflows
  • Conduct in-depth analysis of large unstructured datasets to identify inconsistencies, anomalies, missing values, and potential biases
  • Diagnose and report failure modes (hallucinations, irrelevant answers, formatting errors)
  • Collaborate and serve as a crucial link between AI engineers, QA, data scientists and product managers to set quality standards and release criteria
  • Document processes and maintain reproducibility of evaluation runs
  • Create comprehensive technical documentation, including design specifications, architecture diagrams, and code comments

WHAT YOU’LL NEED

  • 6-8 years of experience in data quality engineering, with hands-on expertise in Python, SQL, automated evaluation pipelines, LLM quality metrics, and end-to-end data validation across complex datasets and AI/ML systems
  • Strong proficiency in Python and SQL (data handling, scripting, test automation)
  • Experience with data cleaning and standardization techniques to facilitate ingestion and analysis by various teams
  • Understanding of generative AI concepts (prompts, hallucinations, grounding)
  • Experience designing structured LLM prompts for evaluations
  • Familiarity with at least one evaluation framework (RAGAS, DeepEval, TruLens, LangSmith) or ability to learn quickly
  • Familiarity with cloud runs and automation (GCP preferred) or ability to learn quickly
  • Ability to translate ambiguous quality expectations into measurable metrics
  • Excellent problem-solving abilities and analytical thinking
  • Effective communication skills to collaborate with cross-functional teams and present technical concepts to both technical and non-technical stakeholders

ABOUT US

Born in 2001, Code and Theory is a digital-first creative agency that sits at the center of creativity and technology. We pride ourselves on not only solving consumer and business problems, but also helping to establish new capabilities for our clients. With a global client roster of Fortune 100s and start-ups alike, we crave the hardest problems to solve. We have teams distributed across North America, South America, Europe, and Asia. The Code and Theory global network of agencies is growing and includes Kettle, Instrument, Left Field Labs, Create Group, Mediacurrent, Rhythm, and TrueLogic.

Striving never to be pigeonholed, we work across every major category: from tech to CPG, financial services to travel & hospitality, government and education to media and publishing. We value the collaboration with our client partners, including but not limited to Adidas, Amazon, Con Edison, Diageo, EY, J.P. Morgan Chase, Lenovo, Marriott, Mars, Microsoft, Thomson Reuters, and TikTok.

The Code and Theory network is comprised of nearly 2,000 people with 50% engineers and 50% creative talent. We’re always on the lookout for smart, driven, and forward-thinking people to join our team.

Top Skills

GCP
Python
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
New York, NY
445 Employees
Year Founded: 2001

What We Do

Code and Theory is a strategically driven, digital-first creative agency that lives at the intersection of creativity and technology. We solve consumer and business problems across the entire customer journey that flex to meet the ever-changing needs of consumer expectations. We put the user, their behaviors and needs, at the center of everything we do — from our proprietary research methodologies, to product development processes, to how we create brand, channel and messaging strategies. Our goal is simple: to solve our clients business problems.

We bring big ideas to life by looking holistically at brand ecosystems where digital plays a prominent role in driving the consumer from first-touch through to conversion to relationship deepening over time. We identify gaps in the consumer journey and opportunities in culture that require products, services or communications to fill. We work across categories, ranging as far and wide as health care (Pfizer, Sanofi, Reach MD, Bioreference Laboratories) to financial services (JP Morgan Chase, Prudential, Morgan Stanley, First Data) to cpg (Mars, Unilever, Johnson & Johnson) to technology companies (Facebook, Xerox, Samsung, Comcast) to culture brands (adidas, H&M). And because our DNA is in publishing — we’ve embedded in over 135 newsrooms in the past decade — we bring unique expertise in understanding how content is created, distributed and optimized, including our work with CNN, NBC News, NBC Sports, and Bustle Digital Group.

At Code and Theory, we strive to only be limited by our own ambition and creativity. We believe in pushing our creativity beyond the easy and obvious answers in order to deliver the solutions that are right for our clients, their businesses, and their consumers.

Gallery

Gallery

Similar Jobs

Boeing Logo Boeing

Software Engineer

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Industrial Area SSI, Rajaji Nagar, Bangalore, Karnataka, IND
141000 Employees
8-8 Annually

Boeing Logo Boeing

Software Engineering Manager

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Industrial Area SSI, Rajaji Nagar, Bangalore, Karnataka, IND
141000 Employees

Boeing Logo Boeing

Lead Software Engineer

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Industrial Area SSI, Rajaji Nagar, Bangalore, Karnataka, IND
141000 Employees

Boeing Logo Boeing

Software Engineer

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Industrial Area SSI, Rajaji Nagar, Bangalore, Karnataka, IND
141000 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account