Lead Architect

Reposted 4 Days Ago
Be an Early Applicant
5 Locations
In-Office
10-14 Annually
Expert/Leader
Artificial Intelligence • Consulting
The Role
Lead the LLMOps team in designing, deploying, and scaling GenAI applications, focusing on automation, observability, and technical leadership in AI engineering.
Summary Generated by Built In

It's fun to work in a company where people truly BELIEVE in what they are doing!

We're committed to bringing passion and customer focus to the business.

Role overview:

We’re building a next-gen LLMOps team at Fractal to industrialize GenAI implementation and shape the future of GenAI engineering. This is a hands-on technical leadership role for AI engineers with strong ML and DevOps skills — ideal for those who love building scalable systems from the ground up. You will be designing, deploying, and scaling GenAI and Agentic AI applications with robust lifecycle automation and observability.

Required Qualifications:

  • 10 - 14 years of experience in working on ML projects that includes product building mindset, strong hands on skills, technical leadership, leading development teams
  • Model development, training, deployment at scale, monitoring performance for production use cases
  • Strong knowledge on Python, Data Engineering, FastAPI, NLP
  • Knowledge on Langchain, Llamaindex, Langtrace, Langfuse, LLM evaluation, MLFlow, BentoML
  • Should have worked on proprietary and open-source LLMs
  • Experience on LLM fine tuning including PEFT/CPT
  • Experience in creating Agentic AI workflows using frameworks like CrewAI, Langraph, AutoGen, Symantec Kernel
  • Experience in performance optimization, RAG, guardrails, AI governance, prompt engineering, evaluation, and observability
  • Experience in GenAI application deployment on cloud and on-premises at scale for production using DevOps practices
  • Experience in DevOps and MLOps
  • Good working knowledge on Kubernetes and Terraform
  • Experience in minimum one cloud: AWS / GCP / Azure to deploy AI services
  • Team player with excellent communication and presentation skills

Must have skills:

  • Product thinking that includes ideation, prototyping, and scale internal accelerators for LLMOps
  • Architect and build scalable LLMOps platforms for enterprise-grade GenAI systems
  • Design and manage end-to-end LLM pipelines from data ingestion and embedding to evaluation and inference
  • Drive LLM-specific infrastructure: memory management, token control, prompt chaining, and context optimization
  • Lead scalable deployment frameworks for LLMs using Kubernetes and GPU-aware scaling
  • Build agentic AI operations capabilities including agent evaluation, observability, orchestration and reflection loops
  • Guardrails & Observability: Implement output filtering, context-aware routing, evaluation harnesses, metrics logging, and incident response
  • Platform Automation for LLMOps: Drive end-to-end automation with Docker, Kubernetes, GitOps, DevOps, Terraform, etc.

Product Thinking: Ideate, prototype, and scale internal accelerators and reusable components for LLMOps

GenAI Engineering: Productionize LLM-powered applications with modular, reusable, and secure patterns

Pipeline Architecture: Create evaluation pipelines — including prompt orchestration, feedback loops, and fine-tuning workflows

Prompt & Model Management: Design systems for versioning, AI governance, automated testing, and prompt quality scoring

Scalable Deployment: Architect cloud-native and hybrid deployment strategies for large-scale inference

Guardrails & Observability: Implement output filtering, context-aware routing, evaluation harnesses, metrics logging, and incident response

DevOps & Platform Automation: Drive end-to-end automation with Docker, Kubernetes, GitOps, Terraform, etc.

Must-Have Technical Skills

  • LLMOps frameworks: LangChain, MLflow, BentoML, Ray, Truss, FastAPI
  • Prompt evaluation and scoring systems: OpenAI evals, Ragas, Rebuff, Outlines
  • Cloud-native deployment: Kubernetes, Helm, Terraform, Docker, GitOps
  • ML pipeline: Airflow, Prefect, Feast, Feature Store
  • Data stack: Spark/Flink, Parquet/Delta, Lakehouse patterns
  • Cloud: Azure ML, GCP Vertex AI, AWS Bedrock/SageMaker
  • Languages: Python (must), Bash, YAML, Terraform HCL (preferred)

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Not the right fit?  Let us know you're interested in a future opportunity by clicking Introduce Yourself in the top-right corner of the page or create an account to set up email alerts as new job postings become available that meet your interest!

Top Skills

AWS
Azure
Bentoml
Fastapi
GCP
Kubernetes
Langchain
Mlflow
Python
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Bellevue, WA
5,262 Employees

What We Do

Fractal is one of the most prominent players in the Artificial Intelligence space. Fractal's mission is to power every human decision in the enterprise and brings AI, engineering, and design to help the world's most admired Fortune 500® companies.

Fractal's products include Qure.ai to assist radiologists in making better diagnostic decisions, Crux Intelligence to assists CEOs, and senior executives make better tactical and strategic decisions, Theremin.ai to improve investment decisions, and Eugenie.ai to find anomalies in high-velocity data & Samya.ai to drive next-generation Enterprise Revenue Growth Management.

Fractal has more than 3,000 employees across 16 global locations, including the United States, UK, Ukraine, India, Singapore, and Australia. Fractal has consistently been rated as India's best companies to work for, by The Great Place to Work® Institute, featured as a leader in Customer Analytics Service Providers Wave™ 2021, Computer Vision Consultancies Wave™ 2020 & Specialized Insights Service Providers Wave™ 2020 by Forrester Research, and recognized as an "Honorable Vendor" in 2021 Magic Quadrant™ for data & analytics by Gartner.

Similar Jobs

Fractal Logo Fractal

Architect

Artificial Intelligence • Consulting
In-Office
6 Locations
5262 Employees
In-Office
4 Locations
60620 Employees

Hitachi Solutions America Logo Hitachi Solutions America

Architect

Information Technology • Consulting
In-Office
Pune, Mahārāshtra, IND
768 Employees

Hitachi Solutions America Logo Hitachi Solutions America

Architect

Information Technology • Consulting
In-Office or Remote
Pune, Mahārāshtra, IND
768 Employees

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account