Subject to alignment to the responsibilities and duties of the role
About GraphcoreAt Graphcore, we’re building the future of AI compute.
We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale.
As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem.
To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world.
We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence
Job SummaryAs a Senior Software Engineer in the Collectives Simulator team, you will drive the design and development of a large-scale collective communication simulator that enables the analysis of network parameters and the efficient implementation of communication algorithms. The ideal candidate will have experience in designing, developing, and maintaining complex software systems involving custom hardware. You will be responsible for leading development efforts, mentoring junior engineers, and driving technical excellence in our projects.
The TeamThe Collectives Simulator team is responsible for building large-scale collective communication simulator for new AI hardware Graphcore is working on. The simulator allows users to in depth analyse communication algorithms in various network topologies.
Responsibilities and Duties- Designing, implementing, testing and documenting Collectives Simulator for new AI hardware
- Collaborating with other teams to design, implement and test new features
- Troubleshooting and resolving complex technical issues
- Ensuring seamless integration of new hardware with existing AI ecosystem
- Participating in agile development – working as part of a scrum team
- Extensive experience in software development using C++ programming language
- Experience with Python and C programming
- Excellent problem-solving skills and ability to debug and resolve complex issues
- Strong knowledge of multithreading and inter-process communication (IPC) techniques for development of efficient concurrent applications
- Experience with unit testing frameworks such as Boost.Test and Google Test
- Proficiency with build tools such as CMake, Make and Ninja
- Strong understanding of version control systems (preferred Git)
- Ability to work within a multinational team and with multinational customers
- Excellent written and verbal communication skills
- Experience in development of SW simulators
- Experience with RDMA networking libraries (for example libibverbs, libfabric)
- Knowledge of multithreading and parallel computing concepts, including experience with parallel algorithms and optimization for AI/ML and HPC systems
- Experience with Continuous Integration/Continuous Delivery (CI/CD) pipelines, including setting up automated workflows and deployments (for example GitHub Actions, GitLab CI)
- Experience with communication libraries (for example NCCL, MPI)
- Knowledge of machine learning frameworks (for example PyTorch)
- Knowledge of modern C++ standards 17/20
In addition to a competitive salary, Graphcore offers annual leave policy, medical and dental health plans, a gym card, and employee pension (matched up to 4%). We review our benefits on a yearly basis to ensure we offer a valuable and rewarding benefits programme to our employees. We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.
Top Skills
What We Do
Graphcore has created a new processor, the Intelligence Processing Unit (IPU), specifically designed for artificial intelligence. The IPU’s unique architecture means developers can run current machine learning models orders of magnitude faster. More importantly, it lets AI researchers undertake entirely new types of work, not possible using current technologies, to drive the next great breakthroughs in general machine intelligence.
Our next generation 3D Wafer-on-Wafer Bow IPU systems are helping AI innovators worldwide to build better, more innovative AI solutions, whether their focus is on language and vision, exploring graph neural networks and LSTMs or creating something entirely new.
We believe our IPU technology will become the worldwide standard for artificial intelligence compute. The performance of Graphcore’s IPU is going to be transformative across all industries and sectors whether you are a medical researcher, roboticist or building autonomous cars.
Our team is at the forefront of the artificial intelligence revolution, enabling innovators from all industries and sectors to expand human potential with technology. What we do, really makes a difference.
We're always interested in hearing from exceptional people to join our team.







