Senior Systems Engineer, Workers AI

Posted Yesterday
Be an Early Applicant
2 Locations
Hybrid
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Helping Build a Better Internet
The Role
Design and build the AI inference infrastructure for Cloudflare, optimizing systems for high availability and performance while mentoring junior engineers.
Summary Generated by Built In
Available Locations: Austin, TX or London, UK (Hybrid) About the role
You'll design and build the core infrastructure that powers AI inference across Cloudflare's global network - real-time voice, frontier open LLMs, and customer-deployed models running on a heterogeneous fleet of GPUs and next-generation accelerators in hundreds of cities worldwide. Working alongside AI/ML engineers, hardware partners, and Cloudflare product teams, you'll solve hard problems in distributed systems and high-performance computing: sub-second model cold starts, multi-accelerator workload scheduling, efficient KV cache management, and a model deployment platform serving both Cloudflare and customers bringing their own models. We're building an AI inference platform embedded in the fabric of the internet - something that doesn't exist yet - and this role puts you at the center of it. We're looking for high-agency systems engineers who are energized by foundational infrastructure problems and want to define how AI runs at the edge of the network.
Role Responsibilities
  • Develop and maintain core components of the serverless inference platform to ensure high availability and scalability for Cloudflare users.
  • Optimize the model scheduling system to significantly increase efficiency and resource utilization across our inference infrastructure.
  • Implement improvements to the inference request routing logic to enhance overall performance and reduce latency for end-users.
  • Drive significant, measurable improvements in the platform's reliability and resilience by identifying and mitigating systemic risks.
  • Expand and refine the observability stack, including metrics, logging, and tracing, and fine-tune alerts to proactively identify and resolve production issues.
  • Lead complex, cross-functional technical projects from initial concept and design through final deployment and operationalization.
  • Act as a mentor to junior engineers and actively contribute to cultivating a strong, collaborative engineering culture within the team.
Role Requirements
Must-Have Skills
  • Experience in systems engineering, with a focus on distributed, high-performance systems.
  • Expert proficiency in Rust programming, particularly in an asynchronous environment.
  • Deep understanding and hands-on experience with relevant networking and application protocols (e.g., TCP, HTTP, WebSocket).
  • Experience with scaling and performance optimization techniques, including load balancing and caching in a distributed environment.

Nice-to-Have Skills
  • Demonstrable experience with container orchestration platforms, specifically Kubernetes and/or Nomad.
  • Familiarity with the challenges and architectures involved in large-scale inference serving (e.g., LLM and diffusion models).

Top Skills

HTTP
Kubernetes
Nomad
Rust
Tcp
Websocket
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
4,400 Employees
Year Founded: 2010

What We Do

Cloudflare, Inc. (NYSE: NET) is the leading connectivity cloud company on a mission to help build a better Internet. It empowers organizations to make their employees, applications and networks faster and more secure everywhere, while reducing complexity and cost. Cloudflare’s connectivity cloud delivers the most full-featured, unified platform of cloud-native products and developer tools, so any organization can gain the control they need to work, develop, and accelerate their business. Powered by one of the world’s largest and most interconnected networks, Cloudflare blocks billions of threats online for its customers every day. It is trusted by millions of organizations – from the largest brands to entrepreneurs and small businesses to nonprofits, humanitarian groups, and governments across the globe.

Why Work With Us

Cloudflare employees come from all walks of life. We are mission-driven, and our team is energized by a collaborative, creative environment that celebrates our differences and fosters new ways to grow together.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Cloudflare Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

We are committed to developing a global team that is distributed with a flexible working approach. Doing this equitably and inclusively is essential to our success. Visit our careers site for more on 'How & Where We Work.'

Typical time on-site: Flexible
HQSan Francisco, CA
Singapore
Austin, TX
Bengaluru, Karnataka
Boston, MA
Champaign, IL
Denver, Colorado
Lisbon, PT
London, GB
Los Angeles, CA
New York, NY
Seattle, WA
Washington, DC
Learn more

Similar Jobs

Cloudflare Logo Cloudflare

Principal Systems Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
2 Locations
4400 Employees

Cloudflare Logo Cloudflare

Talent Analytics Intern (Summer 2026)

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
Austin, TX, USA
4400 Employees

Cloudflare Logo Cloudflare

Customer Advocacy Intern (Summer 2026)

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
2 Locations
4400 Employees
24-24 Hourly

Cloudflare Logo Cloudflare

Machine Learning Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
Austin, TX, USA
4400 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account