Senior Software Engineer

Sorry, this job was removed at 07:25 p.m. (CST) on Monday, Jun 23, 2025
Easy Apply
Be an Early Applicant
Gdańsk, Województwo pomorskie
In-Office
Artificial Intelligence • Semiconductor
The Role

About Graphcore 

How often do you get the chance to build a technology that transforms the future of humanity? Graphcore products have set the standard in made-for-AI compute hardware and software, gaining global attention and industry acclaim. Now we are developing the next generation of artificial intelligence compute with systems that will allow AI researchers to develop more sophisticated models, help scientists unlock exciting new discoveries, and power companies around the world as they put AI at the heart of their business. We recently joined SoftBank Group, bringing large and ongoing investment from one of the world’s leading backers of innovative AI companies.


Job Summary

As a Senior Software Engineer in the ML Software Performance Validation team, you will play a critical role in ensuring end-to-end performance excellence of our proprietary AI hardware and software stack. You will directly report to the Performance Validation Team Lead and collaborate closely with component teams, including ML Framework developers, Compiler and Runtime teams, Infrastructure engineers, and Product Management. Your work will directly influence the efficiency and scalability of our ML software solutions, significantly impacting our business by enabling reliable and performant AI solutions for customers.


The Team

The ML Software Performance Validation team is part of the broader ML Software Engineering organisation, responsible for validating and optimizing the performance of our proprietary ML solutions. Our team comprises experienced engineers and specialists dedicated to rigorous performance benchmarking, analysis, and optimization across large-scale distributed systems. We collaborate closely with internal stakeholders to ensure our products meet the highest standards of efficiency and scalability.


Responsibilities and Duties

  • Develop and maintain automated benchmarking and performance validation frameworks and models for ML software stacks.
  • Analyse performance bottlenecks at scale (single-node, multi-node, and multi-rack) and recommend actionable improvements.
  • Collaborate with ML framework, compiler, and distributed computing teams to validate and enhance software optimizations.
  • Implement performance monitoring, profiling, and tracing tools tailored for ML workloads.
  • Perform systematic scalability testing (scale-up and scale-out) and document findings clearly.
  • Design, automate, and execute comprehensive test plans to validate software performance against defined goals.
  • Lead deep-dive debugging sessions, root-cause analysis of performance issues, and coordinate resolution activities.
  • Document performance validation processes and best practices.

Candidate Profile 

Essential:

  • A passion for your work and the ability to thrive in uncertain and complex environments.
  • Hands-on experience with ML software stacks, particularly PyTorch or similar frameworks.
  • Solid programming skills in Python and proficiency with performance debugging and profiling tools (perf, VTune, TensorBoard, or similar).
  • Good knowledge of distributed computing concepts, collective communication algorithms, and their impact on ML workload performance.
  • Demonstrated ability to analyse complex performance data and communicate findings effectively to technical and non-technical stakeholders.
  • Strong problem-solving skills, with the ability to systematically debug complex software and infrastructure issues.

Desirable

  • Expertise in software performance analysis, profiling, and benchmarking, particularly with large-scale distributed systems.
  • Familiarity with container technologies (Docker, Kubernetes) and their performance implications.
  • Experience with high-performance computing (HPC) clusters and networking technologies (InfiniBand, RDMA).
  • Prior experience with precision timing protocols (NTP, PTP) and time synchronization for performance benchmarking.
  • Knowledge of compiler internals, intermediate representations (IR), and hardware accelerators.
  • Experience working with custom AI/ML hardware accelerators or GPUs.

Benefits

In addition to a competitive salary, Graphcore offers annual leave policy, medical and dental health plans, a gym card, and employee pension (matched up to 4%). We review our benefits on a yearly basis to ensure we offer a valuable and rewarding benefits programme to our employees. We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.


Similar Jobs

Samsara Logo Samsara

Senior Software Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Poland
4000 Employees

Samsara Logo Samsara

Senior Software Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Poland
4000 Employees

Boeing Logo Boeing

Software Engineer

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Gdańsk, Województwo pomorskie, POL
141000 Employees

Boeing Logo Boeing

Devops Engineer

Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
In-Office
Gdańsk, Województwo pomorskie, POL
141000 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
389 Employees
Year Founded: 2016

What We Do

Graphcore has created a new processor, the Intelligence Processing Unit (IPU), specifically designed for artificial intelligence. The IPU’s unique architecture means developers can run current machine learning models orders of magnitude faster. More importantly, it lets AI researchers undertake entirely new types of work, not possible using current technologies, to drive the next great breakthroughs in general machine intelligence.

Our next generation 3D Wafer-on-Wafer Bow IPU systems are helping AI innovators worldwide to build better, more innovative AI solutions, whether their focus is on language and vision, exploring graph neural networks and LSTMs or creating something entirely new.

We believe our IPU technology will become the worldwide standard for artificial intelligence compute. The performance of Graphcore’s IPU is going to be transformative across all industries and sectors whether you are a medical researcher, roboticist or building autonomous cars.

Our team is at the forefront of the artificial intelligence revolution, enabling innovators from all industries and sectors to expand human potential with technology. What we do, really makes a difference.

We're always interested in hearing from exceptional people to join our team.

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account