Architect/Staff Platform Integration Engineer

Reposted 21 Days Ago
Be an Early Applicant
London, Greater London, England, GBR
In-Office
195K-195K Annually
Expert/Leader
Artificial Intelligence • Semiconductor • Manufacturing
The Role
As Software Architect, you will shape OLIX's technical vision in hyperscale AI infrastructure, define architectural standards, and mentor senior engineers across the organization.
Summary Generated by Built In
About OLIX

AI is growing faster than any technology in history and the explosion in demand has created a massive infrastructure gap; we can no longer build chips or power stations fast enough to keep up. The industry is still leaning on a ten-year-old hardware blueprint that has reached its limit. A new paradigm that is faster and more efficient will be the biggest economic opportunity of the next century and create the most important company of the next decade. The OLIX Decode Accelerator 1 (DX-1) is the first accelerator architected specifically for decode. Rack-scale co-design of logic, data movement, packaging, optics and interconnect enables a step change in system level performance.

Role

As Software Platform Integration Engineer, you will be the technical authority on how OLIX serves large models as hyperscale AI infrastructure - spanning distributed inference engines, serving-runtime integration, KV cache and memory hierarchy, and the orchestration and networking layers that make serving real.

We are looking for experienced Architect, Staff and/or Principal-level engineers who have shipped distributed inference at scale and have strong opinions about how modern serving stacks - vLLM, SGLang, NVIDIA Dynamo - should be extended onto novel accelerators. You will partner closely with the Software Engineering Director, Platform Integration to set the technical direction for distributed inference on DX-1, define the architectural contracts the rest of the platform builds against, and make the hard technical calls across the serving stack. You bring rare depth across the full stack, the judgment to know what matters and why, and the influence to drive alignment across engineering without relying on authority.

Responsibilities
  • Shape the technical vision. Partnering with the Software Engineering Director to set long-term technical direction across serving-engine integration (vLLM, SGLang, NVIDIA Dynamo), disaggregated prefill/decode, KV cache management (NIXL / Mooncake TE), cluster orchestration, fleet management, networking, and deployment - and own the architectural integrity of that vision across the full platform lifecycle.

  • Translate strategy into architecture. Work with the Director and cross-functional partners to turn long-term business direction into concrete architectural priorities, and identify where technical investments will have the highest leverage

  • Set the architectural bar. Define the principles, interface contracts, and standards the organisation builds to - across scheduling, fleet operations, ingress/egress, and platform management - and ensure they hold across teams.

  • Make the hard calls. Own the technical decision-making across the platform stack: orchestration and scheduling architecture, fleet management systems, networking design, and deployment strategy.

  • Lead through influence. Drive alignment across teams without direct authority - through rigour, clarity, and the quality of your technical thinking.

  • Raise the technical ceiling. Mentor and stretch senior engineers across the organisation - not as a manager, but as a technical leader who holds the bar high and helps others reach it.

Skills & Experience
  • Deep expertise in distributed inference infrastructure (vLLM, SGLang, Nvidia Dynamo) as well as associated networking (NCCL, RoCE, Infiniband) and KV cache management (NIXL, Mooncake TE) technologies, and rail optimisation to link up accelerator clusters.

  • Deep expertise in cluster management at hyperscale on bare-metal, custom-accelerator fleets - provisioning, scheduling, and lifecycle ownership across thousands of nodes, including safe firmware update orchestration rolled out at fleet scale without compromising production SLOs.

  • Track record driving technical outcomes in high-reliability production inference environments: latency and throughput SLOs, capacity and cost modelling, observability, incident management, and security at scale across fleets of accelerators.

  • Full lifecycle experience from early architecture through to production operations and long-tail reliability.

  • Outstanding technical communicator. You articulate architectural decisions clearly to engineers, managers, and senior leadership alike, and write design thinking that becomes the organisational reference point.

Compensation & Equity
  • Competitive Salary, commensurate with your experience, skills, and location.

  • Equity & Ownership: Meaningful stock options. You’re not just joining the mission; you’re owning a piece of it.

  • Proximity Bonus: We value your time. To minimise your commute and maximise your life, we offer a £24k annual Living-Local Bonus if your residence is within 20 minutes of the office.

Health & Wellbeing

  • Premium Healthcare: Comprehensive BUPA medical and dental cover, including Medical History Disregarded (MHD), for complete peace of mind.

  • Time Off: 25 days of annual leave, plus all UK bank holidays.

The Workspace & Tech

  • Elite Hardware: M4 Macs come as standard, with M4 Pro upgrades for our engineering team. We will provide whatever you need to do your best work.

  • Optimal Environment: High-spec noise-cancelling headphones and a fully ergonomic workstation designed for deep focus.

  • Rapid Prototyping: Access to our high-performance 3D printing lab for work, experimentation, and personal creative projects.

Life at the Office

  • Chef-prepared meals: if you need to work late.

  • Caffeine on Us: We’ve got you covered with a tab at our favourite local coffee shop.

Relocation & Global Mobility

  • Visa Sponsorship: We hire the best in the world. We offer full UK and international visa sponsorship.

  • Seamless Relocation: Whether you’re moving across the country or across the globe, our dedicated relocation partner provides funding and concierge support to get you settled.

Due to U.S. export control regulations, candidates’ eligibility to work at OLIX depends on their most recent citizenship or permanent residency status. We are generally unable to consider applicants whose most recent citizenship or permanent residence is in certain restricted countries (currently including Iran, North Korea, Syria, Cuba, Russia, Belarus, China, Hong Kong, Macau, and Venezuela). Applicants who have subsequently obtained citizenship or permanent residency in another country not subject to these restrictions may still be eligible.

Skills Required

  • Proven track record delivering large-scale distributed infrastructure or platform architecture
  • Deep expertise in distributed systems, cluster orchestration, networking, and fleet operations
  • Track record driving technical outcomes in high-reliability production environments
  • Full lifecycle experience from early architecture through to production operations
  • Outstanding technical communicator
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
82 Employees
Year Founded: 2024

What We Do

The latest generation of AI models achieve breakthrough performance by using vastly more tokens to solve complex problems. As frontier models become more sophisticated, demand is compounding faster than today’s infrastructure can scale. Even the most dominant players, with full-stack control across silicon, software, and supply chains, are unable to solve this within the existing architecture. Inherent constraints in physical design and packaging mean a GPU-based approach is incapable of simultaneously delivering high throughput and high interactivity at low cost. Continuing AI’s advance and making it available to everyone requires a new compute paradigm. One that can overcome the fundamental limits of memory, energy, and speed that define today’s systems. If you like working on difficult and consequential problems, we want you at OLIX. We have offices in London, Austin, Toronto, San Francisco and Bristol. Check out our careers page at olix.com/careers

Similar Jobs

Atlassian Logo Atlassian

Senior Solution Engineer (German Speaking)

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
London, Greater London, England, GBR
11000 Employees

Cloudflare Logo Cloudflare

Project Manager

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
London, Greater London, England, GBR
4400 Employees

Nexthink Logo Nexthink

Field Marketing Intern

Artificial Intelligence • Big Data • Cloud • Information Technology • Machine Learning • Software
Remote or Hybrid
London, Greater London, England, GBR
1200 Employees

PwC Logo PwC

Salesforce CPQ/Revenue Cloud - Senior Associate

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote or Hybrid
57 Locations
370000 Employees
77K-202K Annually

Similar Companies Hiring

Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account