Forward Deployed ML Engineer, Agents

Reposted 6 Days Ago
Be an Early Applicant
2 Locations
In-Office
Mid level
Artificial Intelligence • Information Technology • Software
The Role
As a Forward Deployed ML Engineer at AION, you'll design and develop multimodal AI systems, optimize models, and engage with clients to deliver intelligent agent solutions across various applications.
Summary Generated by Built In
About AION

AION is building an interoperable AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute and provides managed services, aiming to be an end-to-end AI lifecycle platform—taking organizations from data to deployed models using its forward-deployed engineering approach.

AI is transforming every business around the world, and the demand for compute is surging like never before. AION thrives to be the gateway for dynamic compute workloads by building integration bridges with diverse data centers around the world and re-inventing the compute stack via its state-of-the-art serverless technology. We stand at the crossroads where enterprises are finding it hard to balance AI adoption with security. At AION, we take enterprise security and compliance very seriously and are re-thinking every piece of infrastructure from hardware and network packets to API interfaces.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team in India/UK.

Who You Are

You're a hands-on AI engineer with 3-5+ years of experience building production-grade multimodal AI systems and LLM applications. Your responsibilities mirror those of a hands-on AI startup CTO—you work in small teams to own delivery of high-stakes customer projects, embedding directly at client sites to architect, build, and deploy intelligent agent solutions.

You're equally comfortable writing production code, presenting technical solutions to C-level executives, and debugging complex AI systems on factory floors or in customer data centers. You've shipped voice agents, video processing systems, or conversational AI to production. You thrive translating ambiguous business requirements into concrete technical solutions that create measurable impact.

You're comfortable working across the full AI deployment lifecycle—from use case discovery and solution architecture to multimodal agent development, MLOps pipeline implementation, and production optimization. You understand what makes agents perform well in production and how to systematically improve quality through observability and evaluation. Experience with voice AI platforms, RAG systems, and LLM orchestration frameworks is highly desirable. You bring exceptional communication skills, customer empathy, and the drive to build AI solutions that transform enterprise operations globally.


RequirementsWhat You'll Do
Customer Engagement & Multimodal Agent Development
  • Work directly at customer sites—from factory floors to executive offices—conducting discovery workshops and technical assessments to identify high-impact AI opportunities
  • Design and architect end-to-end multimodal agent systems (voice + video + text) that leverage AION's distributed GPU infrastructure and managed services
  • Build production-grade voice AI systems using STT, TTS APIs, and LLMs deployed on AION's platform
  • Develop vision-enabled agents processing real-time video streams using computer vision pipelines on AION's infrastructure
  • Implement sophisticated multi-agent orchestration with(or similar) frameworks like LangChain or LlamaIndex—enabling tool use, memory management, and autonomous task completion
  • Rapidly prototype POCs in 2-4 weeks, coding alongside client teams to validate concepts and iterate based on feedback
  • Optimize for sub-500ms latency, natural conversation flow, turn detection, and interruption handling in real-time systems
  • Integrate agents directly into customer codebases via REST/GraphQL/WebSocket APIs and custom SDKs (Python, TypeScript)
  • Act as trusted technical advisor to customers, shaping AI strategy and guiding roadmap decisions from concept to production
Data Strategy & MLOps Infrastructure
  • Design data architectures with efficient processing pipelines and ingestion workflows for training and inference on AION's platform
  • Implement RAG systems with vector databases—optimizing embedding strategies, chunk sizes, and retrieval methods
  • Prepare and validate datasets for fine-tuning, evaluation, and synthetic data generation
  • Work with other MLEs, MLOps, SREs to carry out model deployment and productionization
Observability, Evaluation & Production Operations
  • Implement LLM and agents observability and monitoring—tracking token usage, latency, costs, and quality metrics across deployments on AION's infrastructure
  • Instrument applications to trace LLM calls, retrieval operations, agent actions, and data flows
  • Build evaluation frameworks with offline benchmarks (accuracy, relevance, safety metrics) and online monitoring (user feedback, drift detection)
Technical Skills & Experience

If you are meeting some of these requirements and feel comfortable catching up on others, we definitely recommend you to apply:

  • 3-5+ years of hands-on experience building production AI/ML systems, with 1-2+ years deploying LLM applications to production
  • Multimodal AI expertise—practical experience building voice agents, vision systems, or conversational AI serving real users
  • Strong LLM foundations—hands-on with modern foundation models including fine-tuning, prompt engineering, and evaluation methodologies
  • Agent framework proficiency—production experience with LangChain, LlamaIndex, or similar orchestration frameworks
  • Voice AI platform experience—built real-time conversational systems with production STT/TTS integration
  • Proficiency in Python (production-grade, async programming, type hints) and JavaScript/TypeScript (full-stack development)
  • RAG implementation experience—built retrieval-augmented generation systems with vector databases
  • MLOps & deployment—hands-on with Docker, Kubernetes, CI/CD pipelines, and infrastructure-as-code
  • Cloud platforms—experience with AWS, Azure, or GCP for ML workloads and infrastructure management
  • Exceptional communication—ability to explain complex AI concepts clearly to both technical and business stakeholders
  • Customer-facing experience in Solutions Architecture, Technical Account Management, or Pre-Sales Engineering is highly desirable
  • Computer vision experience—working with video processing, object detection, or vision-language models is a plus
  • Model fine-tuning—practical experience with LoRA/QLoRA, supervised fine-tuning, or RLHF workflows is a plus
  • Inference optimization—experience with vLLM, TensorRT-LLM, Triton, or model quantization techniques is desirable
  • Observability tooling—practical experience with LLM monitoring, tracing, and evaluation frameworks is a strong plus
  • Familiarity with WebRTC, real-time streaming protocols, and low-latency media processing

Benefits

Why Join AION?

  • Work directly with high-pedigree founders shaping technical and product strategy.
  • Build infrastructure powering the future of AI compute globally.
  • Significant ownership and impact with equity reflective of your contributions.
  • Competitive compensation, flexible work options, and wellness benefits.

Apply Now:
If you’re a machine learning engineer ready to lead MLAAS(Machine learning as a Service) architecture and scale next-generation AI infrastructure, we want to hear from you. Please share the following in the summary section:

  • Your resume highlights relevant projects and leadership experience
  • Links to products, code(Github), or demos you’ve built.
  • A brief note on why AION’s mission excites you.

Top Skills

AWS
Azure
Docker
GCP
GraphQL
JavaScript
Kubernetes
Langchain
Llamaindex
Llms
Python
Rest
Stt
Tts
Typescript
Vector Databases
Websocket
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
21 Employees
Year Founded: 2023

What We Do

Everyday AI Platform: aion collapses the entire ai development lifecycle into a single, unified workspace. From data to deployment - everything at your fingertips. aion simplifies AI infrastructure the way Stripe simplified payments:

Plug-and-Play Multi-Provider Access
Customer Infrastructure Management
Deploy and optimize AI infrastructure via prompts with integrated cost tracking and performance analytics
Partner Sales & Resource Optimization

Track opportunities with confidential pricing, manage real-time inventory allocation, and monitor profitability from aion workloads

Similar Jobs

TransUnion Logo TransUnion

Senior Fraud Specialist

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Remote or Hybrid
2 Locations
13000 Employees

Snap Inc. Logo Snap Inc.

Systems Architect

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
Plymouth, Devon, England, GBR
5000 Employees

Snap Inc. Logo Snap Inc.

Engineering Manager

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
London, Greater London, England, GBR
5000 Employees

Remitly Logo Remitly

Product Designer

eCommerce • Fintech • Payments • Software • Financial Services
In-Office
London, Greater London, England, GBR
2800 Employees

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account