Senior Principal Performance Engineering

Posted 5 Days Ago
Be an Early Applicant
Austin, TX
Hybrid
Senior level
Artificial Intelligence • Semiconductor
Joining Graphcore gives you a seat at the top-table, shaping the future of Artificial Intelligence.
The Role
Lead benchmarking, performance analysis, and system optimization for AI and HPC workloads on Arm-based architectures. Design experiments, build monitoring tools, identify bottlenecks across HW/SW stack, validate new platforms, and collaborate with partners to tune distributed training and influence architecture.
Summary Generated by Built In

Graphcore is a globally recognised leader in Artificial Intelligence computing systems. The company designs advanced semiconductors and data centre hardware that provide the specialised processing power needed to drive AI innovation, while delivering the efficiency required to support its broader adoption.   

As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. We are opening a new AI Engineering Campus in Austin, which will play a central role in Graphcore's work building the future of AI computing. 

Job Overview:Responsibilities:

As a Performance Engineer, you will lead benchmarking, performance analysis, and system optimization across AI and HPC workloads on Arm-based architectures. You will collaborate with hardware architects, software developers, and customer engineering teams to enhance system efficiency and scalability, ensuring Arm technology delivers industry-leading datacenter solutions.

  • Design, implement, and analyze performance experiments for AI training, inference, and HPC applications across distributed clusters.
  • Develop tools and workflows to monitor, measure, and validate system and workload scalability.
  • Partner with system architects and software teams to identify bottlenecks and propose optimizations across the hardware/software stack.
  • Lead performance bring-up and validation of new hardware platforms, interconnects, and accelerators.
  • Collaborate with customers and Tier-1 partners to provide guidance on performance tuning and cluster-level deployment strategies.
  • Drive innovation in performance methodology, including predictive modeling, profiling frameworks, and benchmark development.
  • Present findings to engineering leadership, customers, and partners to influence architectural and design decisions.
Required Skills and Experience:
  • Demonstrated ability in HPC and AI performance engineering, with proven hands-on expertise in distributed systems.
  • Solid understanding of CPU/GPU/accelerator performance analysis, workload profiling, and scalability optimization.
  • Proven experience with ARM64, x86, and GPU architectures in large-scale datacenter environments.
  • Proficiency in performance tools such as VTune, Nsight, Rocprof, Pytorch profiler, MPI/OpenMP profilers, Cray/Allinea tools.
  • Strong programming skills in Python, C/C++, Fortran, CUDA, and parallel frameworks (MPI, OpenMP, SYCL).
  • Experience with large AI frameworks (PyTorch, TensorRT, Megatron-LM, vLLM, SGLang, TorchTitan).
  • Familiarity with distributed training at scale (multi-node, multi-GPU clusters).
  • Excellent communication skills and experience working with cross-functional engineering teams.
“Nice To Have” Skills and Experience :
  • Experience with datacenter-scale benchmarking and system acceptance testing.
  • Knowledge of interconnect fabrics (Infiniband, Slingshot, Omni-Path, RoCE, EFA) and distributed storage systems (Lustre, GPFS, Weka).
  • Hands-on background with cloud HPC/AI deployments (AWS, Azure, GCP).
  • Familiarity with containerization and orchestration (Docker, Kubernetes, SLURM, PBS).
  • Background in exascale or pre-exascale performance co-design projects.
  • Strong publication record in HPC/AI performance analysis.
  • Experience leading small teams or cross-company performance projects.
In Return:
  • Be part of a groundbreaking team influencing the next generation of data center systems.
  • Collaborate with premier engineers and vendors to develop industry-leading AI hardware.
  • Drive innovation in performance methodology with global impact.
  • Access professional growth through sophisticated project involvement and multidisciplinary teamwork.
  • Join a company committed to diversity and inclusion, where your work matters and drives global progress.
Accommodations at Arm

At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email [email protected]. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.

Equal Opportunities at Arm

Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don’t discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Hybrid Working at Arm

Arm’s hybrid approach to working is centred around flexibility, where we split our time between the office and other locations to get our work done. Within that framework, we empower groups and teams to determine their own particular hybrid working pattern, depending on the work and the team’s needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.


Top Skills

Arm64,X86,Gpu,Vtune,Nsight,Rocprof,Pytorch Profiler,Mpi Profiler,Openmp Profiler,Cray/Allinea Tools,Python,C/C++,Fortran,Cuda,Mpi,Openmp,Sycl,Pytorch,Tensorrt,Megatron-Lm,Vllm,Sglang,Torchtitan
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Bristol
488 Employees
Year Founded: 2016

What We Do

At Graphcore, we’re building the future of AI compute.

We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale.

As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem.

To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world.

We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence.

Why Work With Us

Our team is at the forefront of the machine intelligence revolution, enabling innovators from all industries to build AI-native products to expand human potential. What we do at Graphcore really makes a difference.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Graphcore Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

At Graphcore, we value wellbeing and flexibility to support a healthy work/life balance. Our hybrid approach encourages office-based colleagues to work onsite three days a week, with trusted flexibility built on trust and transparency for everyone.

Typical time on-site: 3 days a week
HQHeadquarters
Austin Office
Bengaluru Office
Cambridge Office
Gdańsk Office
Hsinchu Office
London Office
Learn more

Similar Jobs

Graphcore Logo Graphcore

Principal Embedded SW/FW Engineer (Bringup)

Artificial Intelligence • Semiconductor
Hybrid
Austin, TX, USA
488 Employees
241K-326K Annually

Graphcore Logo Graphcore

Staff Engineering Program Support

Artificial Intelligence • Semiconductor
Hybrid
Austin, TX, USA
488 Employees

Graphcore Logo Graphcore

Intern, Knowledge Management & Training Projects

Artificial Intelligence • Semiconductor
Hybrid
Austin, TX, USA
488 Employees

Graphcore Logo Graphcore

Principal Engineer

Artificial Intelligence • Semiconductor
Hybrid
Austin, TX, USA
488 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account