AI/ML Engineer

Posted 2 Days Ago
San Jose, CA, USA
In-Office
140K-165K Annually
Mid level
Big Data • Information Technology
The Role
Build and deploy production AI/LLM systems and agentic workflows for engineering productivity, diagnostics, search, and automation. Design retrieval, tool integration, evals, failure handling, and reusable skills; collaborate with infrastructure and domain teams to monitor and improve systems in production.
Summary Generated by Built In

Astera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizations to unlock the full potential of modern AI. Astera Labs’ Intelligent Connectivity Platform integrates CXL®, Ethernet, NVLink, PCIe®, and UALink™ semiconductor-based technologies with the company’s COSMOS software suite to unify diverse components into cohesive, flexible systems that deliver end-to-end scale-up, and scale-out connectivity. The company’s custom connectivity solutions business complements its standards-based portfolio, enabling customers to deploy tailored architectures to meet their unique infrastructure requirements. Discover more at www.asteralabs.com.

AI/ML Engineer

Location: San Jose, CA
Experience: 1–5 years
Team: Applied AI


The role

We’re hiring an AI/ML Engineer to build production AI systems for technical users. This is an applied engineering role for someone who can take modern model capabilities and turn them into reliable systems that people actually use.

The core problems in this role are the same ones that matter in modern applied AI: getting the right context into the system, making tool use reliable, designing useful abstractions around skills and workflows, building evals that reflect real tasks, and iterating until the system is good enough to become part of a team’s daily workflow.

In practice, you might work on coding agents in terminal and IDE environments, verification and debug assistants, log-analysis systems tied to real product diagnostics, documentation and spec-comparison agents, or internal assistants that operate over company knowledge and engineering data. You will be expected to think end-to-end: prompt and context design, retrieval quality, tool interfaces, evals, failure modes, deployment, and ongoing improvement.


What you’ll do
  • Build AI applications and agentic workflows for engineering productivity, diagnostics, search, documentation, and workflow automation.
  • Design systems that combine LLMs with retrieval, tool use, structured outputs, and evaluation loops.
  • Integrate models with internal tools, APIs, CLIs, MCP interfaces, and operational workflows so they can do useful work in real environments.
  • Improve system quality through eval design, prompt and context iteration, model selection, failure analysis, and human feedback.
  • Build reusable skills, workflows, and abstractions so useful capabilities can be shared across agents and teams instead of rebuilt from scratch.
  • Work closely with infrastructure and domain teams to deploy, monitor, and continuously improve AI systems in production.
What we’re looking for
  • 1–5 years of experience in software engineering, applied AI, ML engineering, or related backend/platform roles.
  • Strong Python skills and strong production engineering fundamentals.
  • Hands-on experience building AI/LLM applications, agents, retrieval-backed systems, or workflow automation.
  • Comfort working with tool-using systems where correctness depends on context quality, tool integration, and careful failure handling.
  • Experience with AWS or GCP and the realities of deploying and debugging production AI services.
  • Good judgment around evals, failure modes, latency/cost tradeoffs, and safe rollout of non-deterministic systems.
  • Clear communication and the ability to turn ambiguous technical workflows into robust product behavior.
What strong candidates often look like

They have built more than demos. They have worked on systems where retrieval quality matters, where tool use can fail in subtle ways, where evaluation changes engineering decisions, and where product usefulness depends as much on system design as on model choice. They usually care about the details that separate a clever prototype from a dependable system.


Why this role is interesting

The team’s direction is very concrete: enterprise search, coding agents, workspace automation, customized skills, and agentic applications for specific engineering problems, all measured against real usage and outcomes. This role sits directly in that path. If you want to build applied AI systems that are ambitious but grounded in real workflows, technical users, and fast feedback loops, this is that job.

 

The base pay range for this position is $140,000 - $165,000 

We know that creativity and innovation happen more often when teams include diverse ideas, backgrounds, and experiences, and we actively encourage everyone with relevant experience to apply, including people of color, LGBTQ+ and non-binary people, veterans, parents, and individuals with disabilities.

Skills Required

  • 1-5 years of experience in software engineering, applied AI, ML engineering, or related backend/platform roles
  • Strong Python skills
  • Strong production engineering fundamentals
  • Hands-on experience building AI/LLM applications, agents, retrieval-backed systems, or workflow automation
  • Experience working with tool-using systems requiring context quality, tool integration, and failure handling
  • Experience with AWS or GCP and deploying/debugging production AI services
  • Good judgment around eval design, failure modes, latency/cost tradeoffs, and safe rollout
  • Clear communication and ability to turn ambiguous technical workflows into robust product behavior
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
148 Employees
Year Founded: 2017

What We Do

Astera Labs Inc., a fabless semiconductor company headquartered in the heart of California’s Silicon Valley, is a leader in purpose-built connectivity solutions for data-centric systems throughout the data center. Partnering with leading processor vendors, cloud service providers, seasoned investors, and world-class manufacturing companies, Astera Labs is helping customers remove performance bottlenecks in data-intensive systems that are limiting the true potential of applications such as artificial intelligence and machine learning. The company’s product portfolio includes system-aware semiconductor integrated circuits, boards, and services to enable robust CXL, PCIe, and Ethernet connectivity.

Similar Jobs

General Motors Logo General Motors

Machine Learning Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
4 Locations
165000 Employees

Chime Logo Chime

Machine Learning Engineer

Fintech • Machine Learning • Mobile • Security • Software
Easy Apply
Hybrid
San Francisco, CA, USA
1500 Employees
125K-173K Annually

Chime Logo Chime

Machine Learning Engineer

Fintech • Machine Learning • Mobile • Security • Software
Easy Apply
Hybrid
4 Locations
1500 Employees
172K-238K Annually

General Motors Logo General Motors

Machine Learning Engineer

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Remote or Hybrid
Sunnyvale, CA, USA
165000 Employees
189K-321K Annually

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account