EXL Jobs

Senior AI Data Engineer

EXL

Senior AI Data Engineer

Posted 8 Days Ago

Be an Early Applicant

Pune, Mahārāshtra, IND

Hybrid

Senior level

Information Technology • Database • Consulting

The Role

Design and build production-grade LLM/GenAI applications using agentic patterns and RAG pipelines. Implement prompt engineering, orchestration, and guardrails. Integrate solutions with enterprise data platforms, develop APIs, collaborate with Data Engineering and MLOps, and contribute reusable components and documentation.

Summary Generated by Built In

Key Responsibilities

Design and develop LLM-powered applications using agentic patterns (single/multi-agent) for business use cases
Build and optimise end-to-end RAG pipelines (ingestion, embeddings, retrieval, orchestration, response synthesis)
Implement prompt engineering and orchestration techniques (prompt chaining, tool/function calling, structured outputs)
Develop production-grade APIs and services (FastAPI/Flask/Streamlit) for GenAI applications
Integrate LLM solutions with enterprise systems, data platforms, and workflows
Apply guardrails and evaluation frameworks to improve response quality, reduce hallucinations, and ensure responsible AI usage
Collaborate with Data Engineering and MLOps teams for data pipelines, deployment, monitoring, and scaling
Contribute to reusable components, documentation, and engineering best practices

Experience & Core Requirements (Must-Have)

Overall Experience

6–9 years total experience
1–3+ years in hands-on GenAI / LLM application development (production use cases)

LLM / GenAI & Agentic Engineering

Strong hands-on experience with:
- LLMs (Claude, OpenAI, etc.)
- RAG pipelines and retrieval optimisation
- GPT + Agentic AI implementation experience
Experience with:
- LangChain, LangGraph, or similar frameworks
- Agent orchestration and tool-calling architectures
Deep understanding of:
- LLM limitations, evaluation, and optimisation strategies

Core Engineering

Strong Python/Pyspark engineering expertise (production-grade development) with proven API integration experience
Deep data analysis experience and handling large volume of data
Fabric/Azure Databricks/Snowflake data engineering integration skills
Good exposure to:
- Cloud platforms (Azure/AWS/GCP)
- SQL
- Containers, CI/CD, monitoring

Data / AI Foundations (Mandatory)

Prior experience in one or more:

Data Engineering (ETL/ELT, pipelines, orchestration)
Data Science / ML lifecycle (especially NLP)
Analytics engineering / data products

Good-to-Have / Preferred

Experience with fine-tuning techniques (LoRA, PEFT) or prompt tuning strategies
Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric, etc.)
Exposure to agentic coding tools (e.g., Claude Code or similar environments)

Responsibilities

Key Responsibilities

Design and develop LLM-powered applications using agentic patterns (single/multi-agent) for business use cases
Build and optimise end-to-end RAG pipelines (ingestion, embeddings, retrieval, orchestration, response synthesis)
Implement prompt engineering and orchestration techniques (prompt chaining, tool/function calling, structured outputs)
Develop production-grade APIs and services (FastAPI/Flask/Streamlit) for GenAI applications
Integrate LLM solutions with enterprise systems, data platforms, and workflows
Apply guardrails and evaluation frameworks to improve response quality, reduce hallucinations, and ensure responsible AI usage
Collaborate with Data Engineering and MLOps teams for data pipelines, deployment, monitoring, and scaling
Contribute to reusable components, documentation, and engineering best practices

Experience & Core Requirements (Must-Have)

Overall Experience

6–9 years total experience
1–3+ years in hands-on GenAI / LLM application development (production use cases)

LLM / GenAI & Agentic Engineering

Strong hands-on experience with:
- LLMs (Claude, OpenAI, etc.)
- RAG pipelines and retrieval optimisation
- GPT + Agentic AI implementation experience
Experience with:
- LangChain, LangGraph, or similar frameworks
- Agent orchestration and tool-calling architectures
Deep understanding of:
- LLM limitations, evaluation, and optimisation strategies

Core Engineering

Strong Python/Pyspark engineering expertise (production-grade development) with proven API integration experience
Deep data analysis experience and handling large volume of data
Fabric/Azure Databricks/Snowflake data engineering integration skills
Good exposure to:
- Cloud platforms (Azure/AWS/GCP)
- SQL
- Containers, CI/CD, monitoring

Data / AI Foundations (Mandatory)

Prior experience in one or more:

Data Engineering (ETL/ELT, pipelines, orchestration)
Data Science / ML lifecycle (especially NLP)
Analytics engineering / data products

Good-to-Have / Preferred

Experience with fine-tuning techniques (LoRA, PEFT) or prompt tuning strategies
Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric, etc.)
Exposure to agentic coding tools (e.g., Claude Code or similar environments)

Qualifications

Key Responsibilities

Design and develop LLM-powered applications using agentic patterns (single/multi-agent) for business use cases
Build and optimise end-to-end RAG pipelines (ingestion, embeddings, retrieval, orchestration, response synthesis)
Implement prompt engineering and orchestration techniques (prompt chaining, tool/function calling, structured outputs)
Develop production-grade APIs and services (FastAPI/Flask/Streamlit) for GenAI applications
Integrate LLM solutions with enterprise systems, data platforms, and workflows
Apply guardrails and evaluation frameworks to improve response quality, reduce hallucinations, and ensure responsible AI usage
Collaborate with Data Engineering and MLOps teams for data pipelines, deployment, monitoring, and scaling
Contribute to reusable components, documentation, and engineering best practices

Experience & Core Requirements (Must-Have)

Overall Experience

6–9 years total experience
1–3+ years in hands-on GenAI / LLM application development (production use cases)

LLM / GenAI & Agentic Engineering

Strong hands-on experience with:
- LLMs (Claude, OpenAI, etc.)
- RAG pipelines and retrieval optimisation
- GPT + Agentic AI implementation experience
Experience with:
- LangChain, LangGraph, or similar frameworks
- Agent orchestration and tool-calling architectures
Deep understanding of:
- LLM limitations, evaluation, and optimisation strategies

Core Engineering

Strong Python/Pyspark engineering expertise (production-grade development) with proven API integration experience
Deep data analysis experience and handling large volume of data
Fabric/Azure Databricks/Snowflake data engineering integration skills
Good exposure to:
- Cloud platforms (Azure/AWS/GCP)
- SQL
- Containers, CI/CD, monitoring

Data / AI Foundations (Mandatory)

Prior experience in one or more:

Data Engineering (ETL/ELT, pipelines, orchestration)
Data Science / ML lifecycle (especially NLP)
Analytics engineering / data products

Good-to-Have / Preferred

Experience with fine-tuning techniques (LoRA, PEFT) or prompt tuning strategies
Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric, etc.)
Exposure to agentic coding tools (e.g., Claude Code or similar environments)

Skills Required

6-9 years total professional experience
1-3+ years hands-on GenAI / LLM application development (production use cases)
Hands-on experience with LLMs (Claude, OpenAI, GPT)
Designing and implementing RAG pipelines, embeddings, retrieval optimization
GPT and agentic AI implementation experience, including agent orchestration and tool/function calling
Experience with LangChain, LangGraph, or similar frameworks
Strong Python and PySpark engineering expertise with production-grade development
Proven experience building production APIs/services (FastAPI, Flask, Streamlit) and API integration
Deep data analysis experience and handling large volumes of data
Experience integrating with Fabric / Azure Databricks / Snowflake data platforms
Exposure to cloud platforms (Azure, AWS, GCP), SQL, containers, CI/CD, and monitoring
Prior experience in one or more: Data Engineering (ETL/ELT, pipelines), Data Science/ML lifecycle (especially NLP), or Analytics engineering/data products
Deep understanding of LLM limitations, evaluation, and optimization strategies; prompt engineering and orchestration techniques
Experience with fine-tuning techniques (LoRA, PEFT)
Experience with enterprise GenAI security & privacy practices (data masking, access control, compliance)
Familiarity with Azure AI ecosystem (Azure OpenAI, Azure AI Search, Fabric)
Exposure to agentic coding tools (e.g., Claude Code)

View all jobs at EXL

View EXL Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: New York, NY

30,246 Employees

Year Founded: 1999

What We Do

Choosing a digital partner is about more than capabilities — it’s about collaboration and character. Unrealistic overhauls and off-the-shelf products ignore what matters most — your unique needs, culture, goals, and your legacy data and technology environments. At EXL, our collaboration is built on ongoing listening and learning to adapt our methodologies. We’re your business evolution partner—tailoring solutions that make the most of data to make better business decisions and drive more intelligence into your increasingly digital operations. Whether your goals are scaling the use of AI and digital, redesign operating models, or driving better and faster decisions, we’re here to partner with you to help you gain—and maintain—competitive advantage with efficient, sustainable models at scale. Our expertise in transformation, data science, and change management helps make your business more efficient and effective, improve customer relationships and enhance revenue growth. Instead of focusing on multi-year, resource- and time-intensive platform designs or migrations, we look deeper at your entire value chain to integrate strategies with impact. We use our specialization in analytics, digital interventions, and operations management—alongside deep industry expertise — to deliver solutions that help you outperform the competition. At EXL, it’s all about outcomes—your outcomes—and delivering success on your terms. Share your goals with us and together, we’ll optimize how you leverage data to drive your business forward. For more information, visit www.exlservice.com.