Staff Software Engineer, Agentic Platform

Reposted 12 Days Ago
Hiring Remotely in Seattle, WA, USA
In-Office or Remote
170K-276K Annually
Senior level
Information Technology
Docker helps developers bring their ideas to reality by conquering the complexity of app development.
The Role
Join Docker's Agentic Platform team to build and operate AI-driven workflows, focusing on infrastructure, orchestration, and technical leadership. Role involves design, operation, and continuous improvement of agent execution runtime and cloud services.
Summary Generated by Built In

Docker has been one of the most loved brands in developer tooling, trusted by more than 20 million monthly users and over 20 billion container image pulls. From solo founders to the world's largest companies, developers rely on Docker to build, share, and run their applications across our suite of products including Docker Desktop, Docker Hub, and Docker Scout.
We are a globally distributed, remote-first team building the tools that define how software gets built and delivered. As AI agents redefine software development, Docker is at the center of that shift, providing the sandboxed environments, verified images, and secure infrastructure that make autonomous workflows trustworthy by default.

Join Docker's Agentic Platform team to build the foundational infrastructure powering the next generation of AI-driven workflows. Intelligent agents are rapidly becoming the primary interface between developers and complex systems and we're building the platform that makes them reliable, scalable, and observable at production scale.

You'll be working on the core agent execution runtime, orchestration primitives, and the cloud infrastructure that keeps the Agentic Platform running 24/7. This is a high-ownership role: you won't just build systems, you'll run them, respond when they fail, and drive continuous improvement across the stack.

This is a greenfield opportunity to shape how agents are built and operated at scale. You'll work alongside seasoned engineers, collaborating with partner teams across AI infrastructure, developer experience, and platform reliability.

Please note: for this role, we are prioritizing candidates who currently live in Seattle, WA Metro Area.

Responsibilities/What you'll work on:

Agent Workflow & Orchestration
  • Design and operate the core agent execution runtime responsible for scheduling, state management, and lifecycle management of long-running agentic workflows

  • Build robust multi-agent coordination patterns: task handoff, agent memory (short-term and long-term), tool use, and workflow branching at scale

  • Develop context window management strategies and session persistence layers for stateful agent interactions

  • Build tooling for prompt engineering as a first-class engineering discipline — versioning, testing, and evaluation of prompts at scale

  • Build platform capabilities that support developers working in AI-assisted coding workflows, including IDE integrations, local-first development environments, and fast iteration loops

Cloud Infrastructure & Service Ownership
  • Own and operate Agentic Platform services in AWS or OCI infrastructure provisioning, scaling, cost management, and reliability

  • Provision and manage cloud infrastructure using Terraform; manage Kubernetes application packaging and deployment with Helm

  • Participate in the 24/7 on-call rotation

  • This role may require participation in a 24/7 on-call rotation for the Agentic Platform; carry genuine pager responsibility for the services you build and operate

  • Define and uphold SLOs; lead incident response, blameless post-mortems, and drive continuous reliability improvements

  • Instrument systems for observability: distributed tracing, structured logging, metrics dashboards, and alerting

Technical Leadership
  • As a Staff Engineer, partner with engineering leadership to set technical direction and serve as a guide and mentor as the team grows

  • Drive architectural decisions that balance velocity with long-term maintainability across a distributed, cloud-native stack

  • Collaborate cross-functionally with product managers, designers, and partner engineering teams to integrate agentic capabilities into the broader developer platform

  • Contribute to a culture of engineering excellence through design reviews, RFC processes, and mentorship

Qualifications for this role

Required:
  • 12+ years of professional, hands-on, full-time software engineering experience in backend, infrastructure, or platform engineering.

  • Cloud Platform Expertise (AWS/OCI/Azure/GCP): Proven, hands-on experience operating production services in AWS or Oracle Cloud Infrastructure compute, networking, managed services, IAM, and cost management. This is a must-have; the Agentic Platform is a cloud-native service running 24/7.

  • Service Ownership in a Cloud Setting: You have owned production services end-to-end — on-call, incident response, SLO definition, and post-mortems. You don't just build; you run what you build.

  • Distributed Systems Design: Deep understanding of fault tolerance, consistency, observability, and scalability in cloud-native environments

  • Backend Engineering Proficiency: Strong proficiency in at least one backend language used for systems work — Go, Python, Rust, or Java

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience

Strongly Preferred:
  • Go: Professional proficiency in Go — Docker's primary language for backend systems

  • Infrastructure as Code: Experience with Terraform for cloud infrastructure provisioning and Helm for Kubernetes application packaging and deployment

  • Data Infrastructure: Experience with PostgreSQL and Redis / Pub-Sub patterns for state management, caching, and event-driven agent workflows

  • MCP & Agent Tooling: Experience with MCP (Model Context Protocol) server design and integration

  • Container & Orchestration: Docker, Kubernetes, or equivalent — especially in the context of agent sandboxing and secure code execution environments

  • AI-assisted development tools: Familiarity with Cursor, Claude Code, Copilot, Windsurf, etc. and the developer personas using them

  • Agent Evaluation: Experience with LLM-as-judge frameworks, behavioral regression testing, and golden dataset management

  • Agent Systems Experience: Hands-on experience building or operating AI agent systems — including multi-agent orchestration, tool use, memory systems, or agent evaluation frameworks

  • Open Source: Contributions or community engagement on relevant open source projects

Docker considers visa sponsorship on a case-by-case basis based on business needs.

Perks

  • Freedom & flexibility; fit your work around your life

  • Designated quarterly Whaleness Days plus end of year Whaleness break

  • Home office setup; we want you comfortable while you work

  • 16 weeks of paid Parental leave (after 6 months of employment)

  • Technology stipend equivalent to $100 USD net/month

  • PTO plan that encourages you to take time to do the things you enjoy

  • Training stipend for conferences, courses and classes

  • Equity; we are a growing start-up and want all employees to have a share in the success of the company

  • Docker Swag

  • Medical benefits, retirement and holidays vary by country

  • Remote-first culture, with offices in Seattle and Paris

Docker embraces diversity and equal opportunity. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. The more inclusive we are, the better our company will be.

#LI-REMOTE

Skills Required

  • 8+ years of professional software engineering experience
  • Proven experience operating production services in AWS or Oracle Cloud
  • Owned production services end-to-end with incident response experience
  • Deep understanding of distributed systems design
  • Strong proficiency in at least one backend language for systems work
  • Bachelor's degree in Computer Science or related field
  • Professional proficiency in Go
  • Experience with Terraform and Helm
  • Experience with PostgreSQL and Redis

Docker, Inc Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Docker, Inc and has not been reviewed or approved by Docker, Inc.

  • Healthcare Strength Healthcare coverage is described as comprehensive, including employer-paid medical, dental, and vision for employees and dependents in the U.S. Additional resources such as telehealth, mental-health support, and an HRA for deductibles are highlighted.
  • Flexible Benefits Remote-first support includes a home office setup budget, monthly technology and coworking stipends, and async/time-zone flexibility. These elements indicate adaptability to distributed work.
  • Leave & Time Off Breadth Time off programs include flexible PTO, companywide wellness days, and a year-end recharge period. Paid parental leave is also offered following an eligibility period.

Docker, Inc Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
498 Employees
Year Founded: 2013

What We Do

At Docker, we simplify the lives of developers who are making world-changing apps. We simplify and accelerate workflows with an integrated development pipeline and application components. Actively used by millions of developers around the world, Docker Desktop and Docker Hub provide unmatched simplicity, agility and choice.

Why Work With Us

We are a people-first organization that provides every employee an opportunity to grow and learn. We provide regular development opportunities for all employees helping employees achieve their goals.

Gallery

Gallery

Similar Jobs

Easy Apply
Remote
United States
900 Employees
31-35 Hourly

Cohere Health Logo Cohere Health

Program Manager

Healthtech • Software
Easy Apply
Remote
United States
900 Employees
110K-125K Annually
Easy Apply
Remote
United States
900 Employees
260K-280K Annually

People Inc. Logo People Inc.

Senior Software Engineer

AdTech • Consumer Web • Digital Media • eCommerce • Marketing Tech
Remote or Hybrid
US
3500 Employees
160K-195K Annually

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account