Senior AI Engineer

Posted 4 Days Ago
Hiring Remotely in United States
Remote
Senior level
Agency • Artificial Intelligence • HR Tech • Professional Services
The Role
Lead design, build, deployment, and maintenance of production-grade GenAI systems (LLM apps, agentic workflows, RAG, retrieval, and evaluation). Architect scalable ML services, implement model serving and observability, optimize latency and cost, fine-tune LLMs, translate prototypes to production, mentor engineers, and produce runbooks and documentation.
Summary Generated by Built In

Rockstar is recruiting for a data intelligence platform that designs, builds, and deploys production-grade AI systems. They are a team of dynamic and savvy professionals who know how to create killer AI applications. Our lean structure and remote team mean we can move fast while still delivering top-notch technology and design.

Position Summary

The client is seeking a Sr. AI Engineer/Sr. Machine Learning Engineer to design, build, deploy, and maintain production-grade AI systems across their data intelligence platform. This role will lead the development of LLM-powered applications, agentic workflows, retrieval-augmented generation systems, model evaluation pipelines, and scalable AI services.

The ideal candidate combines strong machine learning expertise with practical production engineering experience. This person will own complex technical work from concept through deployment, mentor other engineers, and help define best practices for building reliable, observable, and secure AI systems.

Essential Responsibilities
  • Design, build, and deploy production GenAI systems, including LLM applications, agentic workflows, RAG pipelines, and AI-powered search capabilities.
  • Architect scalable AI services using modern ML frameworks, model-serving tools, APIs, Docker, Kubernetes, and CI/CD pipelines.
  • Develop and optimize retrieval systems using embeddings, vector databases, semantic search, reranking, and structured data sources.
  • Fine-tune, adapt, and evaluate LLMs for domain-specific use cases using prompt engineering, supervised fine-tuning, LoRA / QLoRA, or related methods.
  • Build automated evaluation frameworks to measure model quality, prompt performance, retrieval accuracy, reasoning reliability, latency, and cost.
  • Implement observability for AI systems, including tracing, logging, performance monitoring, drift detection, and output-quality review.
  • Translate prototypes and research concepts into reliable product features that can scale in production.
  • Partner with product managers, data engineers, backend engineers, analysts, and business stakeholders to define AI capabilities and technical tradeoffs.
  • Review architecture, provide technical guidance, mentor junior team members, and promote strong engineering practices.
  • Create clear technical documentation, implementation plans, runbooks, and model lifecycle documentation.
Required Qualifications
  • 5+ years of experience in machine learning engineering, AI engineering, data science engineering, or a related technical role.
  • 2+ years of experience building or shipping production GenAI, LLM, or AI-powered systems.
  • Advanced Python programming skills and experience building maintainable production software.
  • Hands-on experience with PyTorch, TensorFlow, Hugging Face Transformers, scikit-learn, or similar ML frameworks.
  • Experience with LLM applications, RAG systems, embeddings, vector databases, prompt engineering, and model evaluation.
  • Experience deploying AI / ML services using Docker, Kubernetes, CI/CD workflows, APIs, and cloud-native infrastructure.
  • Strong understanding of classical machine learning, deep learning, NLP, information retrieval, and model validation.
  • Ability to communicate complex AI concepts clearly to technical and non-technical stakeholders.
  • Experience mentoring engineers, reviewing technical designs, or leading complex AI engineering initiatives.
Preferred Qualifications
  • Advanced degree in Computer Science, Machine Learning, Artificial Intelligence, Data Science, or a related field.
  • Experience with agent frameworks such as LangGraph, AutoGen, CrewAI, or similar tools.
  • Experience with model-serving platforms such as vLLM, BentoML, Triton, Ray Serve, or similar systems.
  • Familiarity with ML observability, experiment tracking, model monitoring, and prompt/version management tools.
  • Experience with graph-based retrieval, knowledge graphs, multimodal models, large-scale data processing, or security-focused data products.
  • Experience with infrastructure-as-code, workflow orchestration, model routing, caching, batching, or quantization.
Special Skills or Experience Required
  • Proven experience building and deploying production GenAI systems, including LLM applications, agentic workflows, and RAG pipelines.
  • Advanced Python and ML framework experience, including PyTorch, TensorFlow, Hugging Face Transformers, or similar tools.
  • Experience with LLM fine-tuning, prompt engineering, embeddings, vector databases, semantic search, and model evaluation.
  • Strong production engineering skills, including Docker, Kubernetes, CI/CD, model serving, observability, latency optimization, and technical leadership.
Success Measures

Success in this role will be measured by the delivery of reliable AI capabilities, improved model quality, reduced latency and cost, stronger evaluation coverage, improved observability, and the successful mentorship of other engineers. The role should help increase the speed and confidence with which the company can move AI features from prototype to production.

Skills Required

  • 5+ years experience in machine learning, AI engineering, or related technical role
  • 2+ years building or shipping production GenAI, LLM, or AI-powered systems
  • Advanced Python programming skills and building maintainable production software
  • Hands-on experience with PyTorch, TensorFlow, Hugging Face Transformers, or similar ML frameworks
  • Experience with LLM applications, RAG systems, embeddings, and vector databases
  • Experience deploying AI/ML services using Docker, Kubernetes, CI/CD workflows, and APIs
  • Strong understanding of classical ML, deep learning, NLP, information retrieval, and model validation
  • Experience fine-tuning/adapting LLMs using prompt engineering, supervised fine-tuning, LoRA/QLoRA, or related methods
  • Experience building automated evaluation frameworks for model quality, retrieval accuracy, and latency/cost metrics
  • Implement observability for AI systems: tracing, logging, monitoring, drift detection, and output-quality review
  • Ability to communicate complex AI concepts to technical and non-technical stakeholders
  • Experience mentoring engineers, reviewing technical designs, or leading complex AI engineering initiatives
  • Advanced degree in CS, ML, AI, Data Science, or related field
  • Experience with agent frameworks (LangGraph, AutoGen, CrewAI) or similar
  • Experience with model-serving platforms (vLLM, BentoML, Triton, Ray Serve) or similar
  • Familiarity with ML observability, experiment tracking, model monitoring, and prompt/version management tools
  • Experience with graph-based retrieval, knowledge graphs, multimodal models, or large-scale data processing
  • Experience with infrastructure-as-code, workflow orchestration, model routing, caching, batching, or quantization
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
6,000 Employees
Year Founded: 1998

What We Do

Rockstar is a full-service recruitment company that leverages a blend of human expertise and artificial intelligence to help businesses hire better and faster at a lower cost. They offer comprehensive recruitment services across a wide range of professional roles, utilizing proprietary AI to efficiently match candidates to job descriptions and conducting custom screening calls to ensure high-quality hires.

Similar Jobs

Optum Logo Optum

Software Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Eden Prairie, MN, USA
160000 Employees
120K-215K Annually

Optum Logo Optum

Machine Learning Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Remote or Hybrid
Basking Ridge, NJ, USA
160000 Employees
120K-215K Annually

Atlassian Logo Atlassian

Senior Systems Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
Salt Lake City, UT, USA
11000 Employees
147K-230K Annually

Optum Logo Optum

Machine Learning Engineer

Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
In-Office or Remote
Eden Prairie, MN, USA
160000 Employees
120K-215K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account