ML Inference Platform Intern (6 months)

Posted 24 Days Ago
Be an Early Applicant
Seattle, WA
In-Office
Internship
Artificial Intelligence • Information Technology • Software
The Role
As an ML Inference Platform Intern, you will learn and implement ML inference optimization techniques, contribute to GPU optimization projects, and build benchmarking frameworks while working under mentorship.
Summary Generated by Built In
About aion

aion is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, aion democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and full stack AI/ML lifecycle.

Led by high-pedigree founders with previous exits, aion is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team across India, London and SF. 

Who You Are

You're an ML systems engineer who's passionate about building high-performance inference infrastructure. You don't need to be an expert in everything - this field is evolving too rapidly for that - but you have strong fundamentals and the curiosity to dive deep into optimization challenges. You thrive in early-stage environments where you'll learn cutting-edge techniques while building production systems. You think systematically about performance bottlenecks and are excited to push the boundaries of what's possible in AI infrastructure.


RequirementsKey Responsibilities
  • Learn and implement ML inference optimization techniques including KV-cache management, dynamic batching, and quantization under mentorship.
  • Contribute to GPU optimization projects using CUDA with hands-on learning of Triton kernel development and performance tuning.
  • Build model benchmarking and evaluation frameworks to assess performance across different models and optimization strategies.
  • Research and experiment with trending open-source models (DeepSeek R1, Qwen 3, Llama variants) to understand optimization opportunities.
  • Implement cost-performance analysis tools to understand tradeoffs between speed, quality, and resource usage.
  • Explore agent system implementations and multi-step reasoning workflows for future platform capabilities.
  • Document learning and create technical guides for internal team knowledge sharing and customer education.
Skills & Experience
  • High agency individual with strong willingness to experiment and learn with the team.
  • Previous internships or projects in ML infrastructure, contributions using PyTorch/ML frameworks, competitive programming achievements, research experience in ML systems, familiarity with agent systems or reasoning techniques.
  • Strong coding and implementation skills in Python and C++ with demonstrated ability to write performant, production-quality code.
  • Experience reading and contributing to large codebases with proof of open-source contributions (GitHub profile required).
  • Proof of technical work through projects like Google Summer of Code, hackathon wins, competitive programming, or significant open-source contributions.
  • Working knowledge of deep learning fundamentals including neural networks, transformers, and basic training/inference concepts.
  • Basic understanding of PyTorch including model development and tensor operations.
  • Fundamental knowledge of GPU computing or strong willingness to learn CUDA programming.
  • Working knowledge of at least one inference framework (vLLM, TensorRT-LLM, Hugging Face) through coursework or personal projects.
  • Understanding of distributed systems concepts and performance optimization principles.

Benefits
  • Join the ground floor of a mission-driven AI startup revolutionizing compute infrastructure.
  • Learn from world-class engineers and gain hands-on experience with cutting-edge inference optimization techniques.
  • Work with a high-caliber, globally distributed team backed by major VCs.
  • Significant learning and growth opportunity in one of the fastest-moving areas of AI infrastructure.
  • Competitive internship compensation with potential for full-time conversion.
  • Fast-paced, flexible work environment with room for ownership and impact.
    In case you got any questions about the role please reach out to hiring manager on linkedin or X.

Top Skills

C++
Cuda
Hugging Face
Open-Source Tools
Python
PyTorch
Tensorrt-Llm
Triton
Vllm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
21 Employees
Year Founded: 2023

What We Do

Everyday AI Platform: aion collapses the entire ai development lifecycle into a single, unified workspace. From data to deployment - everything at your fingertips. aion simplifies AI infrastructure the way Stripe simplified payments:

Plug-and-Play Multi-Provider Access
Customer Infrastructure Management
Deploy and optimize AI infrastructure via prompts with integrated cost tracking and performance analytics
Partner Sales & Resource Optimization

Track opportunities with confidential pricing, manage real-time inventory allocation, and monitor profitability from aion workloads

Similar Jobs

Circle Logo Circle

Business Development Director

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
In-Office
Seattle, WA, USA
230K-285K Annually

Circle Logo Circle

Product Marketing Manager

Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
In-Office
Seattle, WA, USA
145K-193K Annually

ServiceNow Logo ServiceNow

Sr. Manager, Events & Program Management

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Kirkland, WA, USA
163K-285K Annually

Wells Fargo Logo Wells Fargo

Personal Banker Hawks Prairie

Fintech • Financial Services
Hybrid
Lacey, WA, USA
21-30

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account