Graphcore is a globally recognised leader in Artificial Intelligence computing systems. The company designs advanced semiconductors and data centre hardware that provide the specialised processing power needed to drive AI innovation, while delivering the efficiency required to support its broader adoption.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. We are opening a new AI Engineering Campus in Austin, which will play a central role in Graphcore's work building the future of AI computing.
Job Overview:Responsibilities:As a Performance Engineer, you will lead benchmarking, performance analysis, and system optimization across AI and HPC workloads on Arm-based architectures. You will collaborate with hardware architects, software developers, and customer engineering teams to enhance system efficiency and scalability, ensuring Arm technology delivers industry-leading datacenter solutions.
- Design, implement, and analyze performance experiments for AI training, inference, and HPC applications across distributed clusters.
- Develop tools and workflows to monitor, measure, and validate system and workload scalability.
- Partner with system architects and software teams to identify bottlenecks and propose optimizations across the hardware/software stack.
- Lead performance bring-up and validation of new hardware platforms, interconnects, and accelerators.
- Collaborate with customers and Tier-1 partners to provide guidance on performance tuning and cluster-level deployment strategies.
- Drive innovation in performance methodology, including predictive modeling, profiling frameworks, and benchmark development.
- Present findings to engineering leadership, customers, and partners to influence architectural and design decisions.
- Demonstrated ability in HPC and AI performance engineering, with proven hands-on expertise in distributed systems.
- Solid understanding of CPU/GPU/accelerator performance analysis, workload profiling, and scalability optimization.
- Proven experience with ARM64, x86, and GPU architectures in large-scale datacenter environments.
- Proficiency in performance tools such as VTune, Nsight, Rocprof, Pytorch profiler, MPI/OpenMP profilers, Cray/Allinea tools.
- Strong programming skills in Python, C/C++, Fortran, CUDA, and parallel frameworks (MPI, OpenMP, SYCL).
- Experience with large AI frameworks (PyTorch, TensorRT, Megatron-LM, vLLM, SGLang, TorchTitan).
- Familiarity with distributed training at scale (multi-node, multi-GPU clusters).
- Excellent communication skills and experience working with cross-functional engineering teams.
- Experience with datacenter-scale benchmarking and system acceptance testing.
- Knowledge of interconnect fabrics (Infiniband, Slingshot, Omni-Path, RoCE, EFA) and distributed storage systems (Lustre, GPFS, Weka).
- Hands-on background with cloud HPC/AI deployments (AWS, Azure, GCP).
- Familiarity with containerization and orchestration (Docker, Kubernetes, SLURM, PBS).
- Background in exascale or pre-exascale performance co-design projects.
- Strong publication record in HPC/AI performance analysis.
- Experience leading small teams or cross-company performance projects.
- Be part of a groundbreaking team influencing the next generation of data center systems.
- Collaborate with premier engineers and vendors to develop industry-leading AI hardware.
- Drive innovation in performance methodology with global impact.
- Access professional growth through sophisticated project involvement and multidisciplinary teamwork.
- Join a company committed to diversity and inclusion, where your work matters and drives global progress.
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email [email protected]. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Equal Opportunities at ArmArm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don’t discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Hybrid Working at ArmArm’s hybrid approach to working is centred around flexibility, where we split our time between the office and other locations to get our work done. Within that framework, we empower groups and teams to determine their own particular hybrid working pattern, depending on the work and the team’s needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Top Skills
What We Do
At Graphcore, we’re building the future of AI compute.
We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale.
As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem.
To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world.
We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence.
Why Work With Us
Our team is at the forefront of the machine intelligence revolution, enabling innovators from all industries to build AI-native products to expand human potential. What we do at Graphcore really makes a difference.
Gallery
Graphcore Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
At Graphcore, we value wellbeing and flexibility to support a healthy work/life balance. Our hybrid approach encourages office-based colleagues to work onsite three days a week, with trusted flexibility built on trust and transparency for everyone.