Senior AI Data Engineer

Posted 8 Days Ago
Be an Early Applicant
Pune, Mahārāshtra, IND
Hybrid
Senior level
Information Technology • Database • Consulting
The Role
Design and build production-grade LLM/GenAI applications using agentic patterns and RAG pipelines. Implement prompt engineering, orchestration, and guardrails. Integrate solutions with enterprise data platforms, develop APIs, collaborate with Data Engineering and MLOps, and contribute reusable components and documentation.
Summary Generated by Built In

Key Responsibilities

  • Design and develop LLM-powered applications using agentic patterns (single/multi-agent) for business use cases
  • Build and optimise end-to-end RAG pipelines (ingestion, embeddings, retrieval, orchestration, response synthesis)
  • Implement prompt engineering and orchestration techniques (prompt chaining, tool/function calling, structured outputs)
  • Develop production-grade APIs and services (FastAPI/Flask/Streamlit) for GenAI applications
  • Integrate LLM solutions with enterprise systems, data platforms, and workflows
  • Apply guardrails and evaluation frameworks to improve response quality, reduce hallucinations, and ensure responsible AI usage
  • Collaborate with Data Engineering and MLOps teams for data pipelines, deployment, monitoring, and scaling
  • Contribute to reusable components, documentation, and engineering best practices
 

Experience & Core Requirements (Must-Have)

Overall Experience

  • 6–9 years total experience
  • 1–3+ years in hands-on GenAI / LLM application development (production use cases)
 

LLM / GenAI & Agentic Engineering

  • Strong hands-on experience with:
    • LLMs (Claude, OpenAI, etc.)
    • RAG pipelines and retrieval optimisation
    • GPT + Agentic AI implementation experience
  • Experience with:
    • LangChain, LangGraph, or similar frameworks
    • Agent orchestration and tool-calling architectures
  • Deep understanding of:
    • LLM limitations, evaluation, and optimisation strategies
 

Core Engineering

  • Strong Python/Pyspark engineering expertise (production-grade development) with proven API integration experience
  • Deep data analysis experience and handling large volume of data
  • Fabric/Azure Databricks/Snowflake data engineering integration skills
  • Good exposure to:
    • Cloud platforms (Azure/AWS/GCP)
    • SQL
    • Containers, CI/CD, monitoring
 

Data / AI Foundations (Mandatory)

Prior experience in one or more:

  • Data Engineering (ETL/ELT, pipelines, orchestration)
  • Data Science / ML lifecycle (especially NLP)
  • Analytics engineering / data products
 

Good-to-Have / Preferred

  • Experience with fine-tuning techniques (LoRA, PEFT) or prompt tuning strategies
  • Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
  • Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric, etc.)
  • Exposure to agentic coding tools (e.g., Claude Code or similar environments)
Responsibilities

Key Responsibilities

  • Design and develop LLM-powered applications using agentic patterns (single/multi-agent) for business use cases
  • Build and optimise end-to-end RAG pipelines (ingestion, embeddings, retrieval, orchestration, response synthesis)
  • Implement prompt engineering and orchestration techniques (prompt chaining, tool/function calling, structured outputs)
  • Develop production-grade APIs and services (FastAPI/Flask/Streamlit) for GenAI applications
  • Integrate LLM solutions with enterprise systems, data platforms, and workflows
  • Apply guardrails and evaluation frameworks to improve response quality, reduce hallucinations, and ensure responsible AI usage
  • Collaborate with Data Engineering and MLOps teams for data pipelines, deployment, monitoring, and scaling
  • Contribute to reusable components, documentation, and engineering best practices
 

Experience & Core Requirements (Must-Have)

Overall Experience

  • 6–9 years total experience
  • 1–3+ years in hands-on GenAI / LLM application development (production use cases)
 

LLM / GenAI & Agentic Engineering

  • Strong hands-on experience with:
    • LLMs (Claude, OpenAI, etc.)
    • RAG pipelines and retrieval optimisation
    • GPT + Agentic AI implementation experience
  • Experience with:
    • LangChain, LangGraph, or similar frameworks
    • Agent orchestration and tool-calling architectures
  • Deep understanding of:
    • LLM limitations, evaluation, and optimisation strategies
 

Core Engineering

  • Strong Python/Pyspark engineering expertise (production-grade development) with proven API integration experience
  • Deep data analysis experience and handling large volume of data
  • Fabric/Azure Databricks/Snowflake data engineering integration skills
  • Good exposure to:
    • Cloud platforms (Azure/AWS/GCP)
    • SQL
    • Containers, CI/CD, monitoring
 

Data / AI Foundations (Mandatory)

Prior experience in one or more:

  • Data Engineering (ETL/ELT, pipelines, orchestration)
  • Data Science / ML lifecycle (especially NLP)
  • Analytics engineering / data products
 

Good-to-Have / Preferred

  • Experience with fine-tuning techniques (LoRA, PEFT) or prompt tuning strategies
  • Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
  • Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric, etc.)
  • Exposure to agentic coding tools (e.g., Claude Code or similar environments)
Qualifications

Key Responsibilities

  • Design and develop LLM-powered applications using agentic patterns (single/multi-agent) for business use cases
  • Build and optimise end-to-end RAG pipelines (ingestion, embeddings, retrieval, orchestration, response synthesis)
  • Implement prompt engineering and orchestration techniques (prompt chaining, tool/function calling, structured outputs)
  • Develop production-grade APIs and services (FastAPI/Flask/Streamlit) for GenAI applications
  • Integrate LLM solutions with enterprise systems, data platforms, and workflows
  • Apply guardrails and evaluation frameworks to improve response quality, reduce hallucinations, and ensure responsible AI usage
  • Collaborate with Data Engineering and MLOps teams for data pipelines, deployment, monitoring, and scaling
  • Contribute to reusable components, documentation, and engineering best practices
 

Experience & Core Requirements (Must-Have)

Overall Experience

  • 6–9 years total experience
  • 1–3+ years in hands-on GenAI / LLM application development (production use cases)
 

LLM / GenAI & Agentic Engineering

  • Strong hands-on experience with:
    • LLMs (Claude, OpenAI, etc.)
    • RAG pipelines and retrieval optimisation
    • GPT + Agentic AI implementation experience
  • Experience with:
    • LangChain, LangGraph, or similar frameworks
    • Agent orchestration and tool-calling architectures
  • Deep understanding of:
    • LLM limitations, evaluation, and optimisation strategies
 

Core Engineering

  • Strong Python/Pyspark engineering expertise (production-grade development) with proven API integration experience
  • Deep data analysis experience and handling large volume of data
  • Fabric/Azure Databricks/Snowflake data engineering integration skills
  • Good exposure to:
    • Cloud platforms (Azure/AWS/GCP)
    • SQL
    • Containers, CI/CD, monitoring
 

Data / AI Foundations (Mandatory)

Prior experience in one or more:

  • Data Engineering (ETL/ELT, pipelines, orchestration)
  • Data Science / ML lifecycle (especially NLP)
  • Analytics engineering / data products
 

Good-to-Have / Preferred

  • Experience with fine-tuning techniques (LoRA, PEFT) or prompt tuning strategies
  • Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
  • Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric, etc.)
  • Exposure to agentic coding tools (e.g., Claude Code or similar environments)

Skills Required

  • 6-9 years total professional experience
  • 1-3+ years hands-on GenAI / LLM application development (production use cases)
  • Hands-on experience with LLMs (Claude, OpenAI, GPT)
  • Designing and implementing RAG pipelines, embeddings, retrieval optimization
  • GPT and agentic AI implementation experience, including agent orchestration and tool/function calling
  • Experience with LangChain, LangGraph, or similar frameworks
  • Strong Python and PySpark engineering expertise with production-grade development
  • Proven experience building production APIs/services (FastAPI, Flask, Streamlit) and API integration
  • Deep data analysis experience and handling large volumes of data
  • Experience integrating with Fabric / Azure Databricks / Snowflake data platforms
  • Exposure to cloud platforms (Azure, AWS, GCP), SQL, containers, CI/CD, and monitoring
  • Prior experience in one or more: Data Engineering (ETL/ELT, pipelines), Data Science/ML lifecycle (especially NLP), or Analytics engineering/data products
  • Deep understanding of LLM limitations, evaluation, and optimization strategies; prompt engineering and orchestration techniques
  • Experience with fine-tuning techniques (LoRA, PEFT)
  • Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
  • Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric)
  • Exposure to agentic coding tools (e.g., Claude Code)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: New York, NY
30,246 Employees
Year Founded: 1999

What We Do

Choosing a digital partner is about more than capabilities — it’s about collaboration and character. Unrealistic overhauls and off-the-shelf products ignore what matters most — your unique needs, culture, goals, and your legacy data and technology environments. At EXL, our collaboration is built on ongoing listening and learning to adapt our methodologies. We’re your business evolution partner—tailoring solutions that make the most of data to make better business decisions and drive more intelligence into your increasingly digital operations. Whether your goals are scaling the use of AI and digital, redesign operating models, or driving better and faster decisions, we’re here to partner with you to help you gain—and maintain—competitive advantage with efficient, sustainable models at scale. Our expertise in transformation, data science, and change management helps make your business more efficient and effective, improve customer relationships and enhance revenue growth. Instead of focusing on multi-year, resource- and time-intensive platform designs or migrations, we look deeper at your entire value chain to integrate strategies with impact. We use our specialization in analytics, digital interventions, and operations management—alongside deep industry expertise — to deliver solutions that help you outperform the competition. At EXL, it’s all about outcomes—your outcomes—and delivering success on your terms. Share your goals with us and together, we’ll optimize how you leverage data to drive your business forward. For more information, visit www.exlservice.com.

Similar Jobs

Avaloq Logo Avaloq

Platform Engineer

Information Technology • Consulting
In-Office
Pune, Mahārāshtra, IND
2397 Employees
In-Office or Remote
2 Locations
1151 Employees

Mastercard Logo Mastercard

Software Engineer

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Hybrid
Pune, Mahārāshtra, IND
38800 Employees

Mastercard Logo Mastercard

Technical Program Manager

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Hybrid
Pune, Mahārāshtra, IND
38800 Employees

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account