Staff Software Engineer, AI Runtime

Posted 2 Days Ago
Hiring Remotely in United States
Remote
185K-215K Annually
Senior level
Artificial Intelligence • Big Data • Information Technology • Professional Services • Software
Apollo makes application development easier, better, and accessible to more people.
The Role
As a Staff Software Engineer, you will architect and scale AI/MCP Server and Gateway for multi-agent workflows, ensuring reliability and performance while collaborating with cross-functional teams.
Summary Generated by Built In

We’re seeking a Staff Software Engineer to help power the future of agentic AI workflows. You’ll take our MCP Server to the next level, turning it into an enterprise-grade service that lets diverse tools and systems be exposed effortlessly to AI agents. Looking ahead, you’ll also help architect the MCP Gateway—a new layer that will route requests across tools, enforce policies, and provide the runtime foundation for scalable multi-agent systems. Along the way, you’ll tackle challenges in scalability, performance, and developer experience to ensure our platform feels seamless, powerful, and enterprise-ready.


About the Team

The Graph DX AI Runtime Team builds and maintains the MCP Server and Gateway—the backbone of agent-to-tool communication and the routing layer that keeps everything flowing. We make it simple for developers to wire up agents, orchestrate workflows, and scale interactions reliably. Our focus is on speed, security, and seamless integration, so teams can spend less time managing infrastructure and more time building intelligent experiences.

What You'll Do

  • Architect and scale an enterprise AI/MCP Server and Gateway that powers multi-agent workflows across Apollo, including routing, orchestration, and integration boundaries.

  • Design and implement robust server infrastructure to ensure reliability, performance, and security at scale.

  • Build and maintain tools for agent discovery, communication, and coordination.

  • Define deployment strategies and runtime optimizations to maximize efficiency and minimize operational overhead.

  • Develop frameworks and patterns that enable seamless multi-agent collaboration and AI-driven orchestration.

  • Integrate observability, logging, and monitoring for full visibility into server and agent behavior.

  • Explore and implement AI-enhanced developer workflows to optimize orchestration and agent interactions.

  • Collaborate with teams across Apollo to ensure the MCP Server meets evolving product and developer needs.


Technical Challenges You'll Tackle

  • Architect and scale the MCP Gateway—Apollo’s routing layer for agentic workflows—ensuring tools and services can be discovered, invoked, and orchestrated reliably across diverse environments.

  • Design and implement high-performance routing infrastructure with reliability, scalability, and security at its core.

  • Build and maintain routing patterns and coordination mechanisms that let agents interact with the right tools at the right time.

  • Define deployment strategies and runtime optimizations to minimize latency and operational overhead.

  • Explore and implement AI-driven routing strategies to optimize context retrieval, reduce cost, and improve decision accuracy.

  • Collaborate with teams across Apollo to ensure the MCP Server and Gateway integrate seamlessly with Apollo’s control plane for AI tools.

  • Integrate observability and monitoring into the routing layer to provide full visibility into traffic flows, tool availability, and agent interactions.

Who You Are

  • Expertise in agent-to-tool orchestration, routing, and coordination in scalable, fault-tolerant systems.

  • Deep expertise in Rust programming language

  • Strong background in distributed systems, server architecture, and high-performance backend development.

  • Proven experience with protocol design, message routing, and server-side orchestration frameworks.

  • Experience building and maintaining robust runtime infrastructure that supports AI-driven workflows and enables reliable agent-to-tool interactions.

  • Proven experience with protocol design, message routing, and building server-side frameworks that enable scalable, reliable multi-tool agent workflows.

  • Hands-on experience with observability, monitoring, and debugging frameworks for complex systems.

  • Passion for clean, maintainable code, high system reliability, and scalable architecture.

  • Experience in strategic system design, making architectural trade-offs, and planning for long-term scalability and maintainability.

  • Strong technical leadership and mentorship, including guiding junior engineers and driving engineering best practices across teams.

  • Ability to influence cross-team architecture decisions and align engineering efforts with product and business objectives.

  • Production ownership experience: leading incident response, debugging, and performance optimization in high-impact backend systems.

Bonus Points

  • Exposure to AI/ML-enabled developer tooling or autonomous system orchestration.

  • Familiarity with cloud-native architectures, containerization, or orchestration frameworks.

  • Experience with performance optimization and cost-efficient scaling of high-throughput distributed systems.


At Apollo, we strive to provide competitive, market-informed compensation whilst ensuring consistency within the team in each country. We make hiring decisions based on your skills, experience, and our overall assessment of what we learned during the hiring process. In addition to the U.S. base salary range, we also provide equity and benefits.


Apollo offers all U.S. employees a choice of 3 Anthem Blue Cross medical plans and California residents can also choose from an additional 2 Kaiser medical plans. Dental and Vision benefits are provided by Sun Life Financial.


Location: This is a remote position that can be done from anywhere in the United States or Canada.


Equal Opportunity: Apollo is proud to be an equal opportunity workplace dedicated to pursuing and hiring a talented and diverse workforce.


Privacy: California residents applying for positions at Apollo can see our privacy policy here.


E-Verify: Apollo is an E-Verify employer and will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S. For more information, please visit E-Verify.

Top Skills

AI
Distributed Systems
High-Performance Backend Development
Rust
Server Architecture
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
280 Employees
Year Founded: 2016

What We Do

Apollo GraphQL is powering the future of modern development, enabling businesses to accelerate product velocity and platform innovation with the power of the supergraph. Apollo is the most widespread implementation of GraphQL. The company is focused on using its market and technical leadership to make application development easier, better, and faster for everyone by combining APIs, databases, and microservices into a supergraph that can effectively be queried by GraphQL.

Why Work With Us

We’re the company responsible for bringing GraphQL to the global community. We are scaling to ensure that we have the wherewithal to keep our promises to our subscribers, community and GraphQL enthusiasts. Our revenue is going to more than 2x in 2022 and we’re planning on adding 100 new Apollonauts by the end of 2022.

Gallery

Gallery

Similar Jobs

Cloudflare Logo Cloudflare

Sales Director, US Majors Heartland

Cloud • Information Technology • Security • Software • Cybersecurity
Remote or Hybrid
United States
4400 Employees

Kalshi Logo Kalshi

Accountant

Fintech • Payments • Financial Services
Easy Apply
In-Office or Remote
2 Locations
203 Employees
100K-180K Annually

Cox Enterprises Logo Cox Enterprises

Search Engine Optimization Specialist

Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
21-32 Hourly
Remote or Hybrid
Louisville, CO, USA
256 Employees
72K-119K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account