We are building a central AI control plane to support the safe, scalable, and efficient use of AI across Arm's engineering teams.
This hands-on platform engineering role will help deliver and own the runtime platforms that make AI services reliable, secure, observable and supportable at Arm scale. You will work across Kubernetes, cloud, identity, secrets, networking, telemetry, incident management and automation to provide the production foundation for Arm's AI platform.
You will initially contribute in three key areas:
Production AI runtime platforms:
- Build/deploy and operate the infrastructure for centrally hosted AI platform services, including MCP server infrastructure, model gateway services and supporting control-plane components.
- Design runtime patterns for isolation, scalability, secure execution, capacity management and cost-aware operation.
- Automate provisioning, configuration, upgrades and lifecycle management using infrastructure-as-code and GitOps patterns.
Reliability, observability and support:
- Define and implement service-level indicators, service-level objectives, alerting, dashboards, runbooks and support workflows.
- Incident response, post-incident review and vendor outage handling for AI services embedded in engineering workflows.
- Build telemetry that helps Arm understand AI platform health, usage, performance, cost and operational risk.
Secure operations by default:
- Ensure platform components meet production readiness, security and compliance expectations.
- Help make secure AI usage the default by providing reliable paved paths rather than manual or fragmented infrastructure.
Responsibilities:
- Build, operate and continuously improve the production infrastructure for Arm's AI platform services, using both custom and third party solutions.
- Own reliability, scalability, monitoring, alerting, incident response, runbooks and operational readiness for AI platform components.
- Develop automation for provisioning, deployment, configuration, backup, recovery, patching, upgrades and lifecycle management.
- Implement secure runtime patterns, including workload isolation, secrets management, identity integration, network controls and auditability.
- Participate in production support and on-call arrangements
Required Skills and Experience:
- Familiarity with AI platform concepts such as model routing, MCP servers, agentic workflows, RAG systems or LLM observability.
- Strong experience operating production infrastructure or platform services in a Linux-based environment using Kubernetes, containers, cloud or private-cloud platforms, infrastructure-as-code and CI/CD/GitOps tooling.
- Strong automation and scripting skills, for example Terraform, Go, Python or similar.
- Experience with incident management, problem management, demand forecasting, and production readiness practices.
- Good understanding of security fundamentals for production platforms, including identity, secrets, access control, network segmentation, vulnerability management and audit logging.
"Nice To Have" Skills and Experience:
- Experience operating internal developer platforms, AI platforms, model gateways, MCP infrastructure or other shared engineering platforms.
- Experience with service mesh, policy-as-code, workload identity, sandboxing, secure runtime environments or multi-tenant platform designs.
- Experience with regulated or security-sensitive engineering environments.
In Return:
This is an opportunity to help shape a new AI Platform capability for Arm Engineering, working on services that enable thousands of engineers to use AI safely, productively and at scale.
You will work in a high-impact platform engineering environment, collaborating with highly technical peers across Arm from Engineering, IT, Security and Architecture.
Accommodations at Arm:
At Arm, we want our people to Do Great Things. If you need support or an accommodation to Be Your Brilliant Self during the recruitment process, please email [email protected]. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Equal Opportunities at Arm:
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don't discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Hybrid Working at Arm:
Arm's hybrid approach to working is centered around flexibility, where we split our time between the office and other locations to get our work done. Within that framework, we empower groups and teams to determine their own particular hybrid working pattern, depending on the work and the team's needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Salary Range:
$161,500-$218,500 per year
We value people as individuals and our dedication is to reward people competitively and equitably for the work they do and the skills and experience they bring to Arm. Salary is only one component of Arm's offering. The total reward package will be shared with candidates during the recruitment and selection process.
Accommodations at Arm
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email [email protected] . To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Hybrid Working at Arm
Arm's approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace, whilst recognizing the value of flexibility. Within that framework, we empower groups/teams to determine their own hybrid working patterns, depending on the work and the team's needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don't discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Skills Required
- Experience operating production infrastructure in a Linux-based environment using Kubernetes
- Strong automation and scripting skills, e.g. Terraform, Go, Python
- Experience with incident management and production readiness practices
- Good understanding of security fundamentals for production platforms
- Familiarity with AI platform concepts and services
What We Do
We bring brilliant people together in a global ecosystem that is sparking the world’s potential. Arm technology enables specialized processing built on the economics, design freedom and accessibility of general-purpose compute that has, so far, led to more than 180 billion chips being shipped by our partners.
Why Work With Us
At Arm, we build the future of computing, powering everything from smartphones to AI. Our 10x mindset drives bold thinking and deep collaboration to solve complex problems together. With a people first culture, flexible work, and strong support for growth and wellbeing, your ideas can make a global impact while your career thrives.
Gallery
Arm Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.