Lead AI Developer and DevOps

Posted 11 Days Ago
Be an Early Applicant
8 Locations
In-Office
Expert/Leader
AdTech • Marketing Tech • Software
The Role
Lead AI Developer and DevOps position focusing on AI model development, deployment, and infrastructure management using various ML and cloud technologies.
Summary Generated by Built In
The purpose of this role is to lead the collaboration with ML Engineers and DevOps Engineers to formulate AI designs that can be built, tested and deployed through the Route to Live and into Production using continuous integration / deployment.

Job Description:

Model Development & Deployment

Model fine-tuning: Use open-source libraries like DeepSpeed, Hugging Face Transformers, JAX, PyTorch, and TensorFlow to improve model performance Large Language Model Operations (LLMOps)

Model deployment and maintenance: deploying and managing LLMs on cloud platforms

Model training and fine-tuning: training and refining LLMs to improve their performance on specific tasks

work out how to scale LLMs up and down, do blue/green deployments and roll back bad releases

Data Management & Pipeline Operations

Curating and preparing training data, as well as monitoring and maintaining data quality

Data prep and prompt engineering: Iteratively transform, aggregate, and de-duplicate data, and make the data visible and shareable across data teams

Building vector databases to retrieve contextually relevant information

Monitoring & Evaluation

Monitoring and evaluation: tracking LLM performance, identifying errors, and optimizing models

Model monitoring with human feedback: Create model and data monitoring pipelines with alerts both for model drift and for malicious user behavior

Establish monitoring metrics

Infrastructure & DevOps

Continuous integration and delivery (CI/CD), where CI/CD pipelines automate the model development process and streamline testing and deployment

Develop and manage infrastructure for distributed model training (e.g., SageMaker, Ray, Kubernetes). Deploy ML models using containerization (Docker)

Required Technical Skills

Programming & Frameworks

Use open-source libraries like DeepSpeed, Hugging Face Transformers, JAX, PyTorch, and TensorFlow

LLM pipelines, built using tools like LangChain or LlamaIndex

Python programming expertise for ML model development

Experience with containerization technologies (Docker, Kubernetes)

Cloud Platforms & Infrastructure

Familiarity with cloud platforms like AWS, Azure, or GCP, including knowledge of services like EC2, S3, SageMaker, or Google Cloud ML Engine for scalable and efficient model deployment

Deploying large language models on Azure and AWS clouds or services such as Databricks

Experience with distributed training infrastructure

LLM-Specific Technologies

Vector databases for RAG implementations

Prompt engineering and template management

Techniques such as few-shot and chain-of-thought (CoT) prompting enhance the model's accuracy and response quality

Fine-tuning and model customization techniques

Knowlege Graphs

Relevance Engineering

Location:

DGS India - Pune - Baner M- Agile

Brand:

Merkle

Time Type:

Full time

Contract Type:

Permanent

Top Skills

AWS
Azure
Deepspeed
Docker
GCP
Hugging Face Transformers
Jax
Kubernetes
Python
PyTorch
TensorFlow
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
6,507 Employees

What We Do

Dentsu Creative is a global creative agency network designed to unlock exponential growth for clients. We use Transformative Creativity as a differentiating, driving force to bring our capabilities together to positively impact people, business and society. Established in 2022, Dentsu Creative is integrated with dentsu’s Media and CXM businesses in over 145 countries and regions, to offer Integrated Growth Solutions.

Similar Jobs

dentsu Logo dentsu

Artificial Intelligence Engineer

AdTech • Marketing Tech
In-Office
8 Locations

CrowdStrike Logo CrowdStrike

Sales Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Hybrid
Delhi, Connaught Place, New Delhi, Delhi, IND
5-7

CrowdStrike Logo CrowdStrike

Sales Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
4 Locations

CrowdStrike Logo CrowdStrike

Engineer III - Reliability ( 2PM - 11PM IST) (Remote, IND)

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
16 Locations

Similar Companies Hiring

Compa Thumbnail
Software • Other • HR Tech • Business Intelligence • Artificial Intelligence
Irvine, CA
48 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
ClickMint Thumbnail
Marketing Tech • Generative AI • eCommerce • AdTech
Malibu, CA
7 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account