Senior Machine Learning Engineer (Large Systems)

Posted 17 Days Ago
Be an Early Applicant
3 Locations
In-Office
Senior level
Artificial Intelligence • Semiconductor
The Role
As a Senior Machine Learning Engineer, you will develop and optimize AI models for performance on specialized hardware, collaborate with teams, and engage with the AI community.
Summary Generated by Built In
About Graphcore

Graphcore is one of the world’s leading innovators in Artificial Intelligence compute. 

It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry. 

As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.  

Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation. 

Job Summary

As a Senior Machine Learning Engineer in the Applied AI team at Graphcore, you will contribute to advancing AI technology by developing and optimising AI models tailored to our specialised hardware. You will work on large scale systems where performance is critical to the success of our projects. Working closely with the Software development and Research teams, you will play a critical role in identifying opportunities to innovate and differentiate Graphcore’s technology. We seek engineers with strong technical skills and an understanding of AI model implementation at scale, eager to make a tangible impact in this rapidly evolving field.


The Team

The Applied AI team’s role is to be proxies for our customers, we need to understand the latest AI models, applications, and software to ensure that Graphcore’s technology works seamlessly with the AI ecosystem and at scale. We build reference applications, contribute to key software libraries e.g. optimising kernels for efficiency on our hardware, and collaborate with the Research team to develop and publish novel ideas in domains such as efficient compute, model scaling and distributed training and inference of AI models for multiple modalities and applications.
If you're excited about advancing the next generation of AI models on cutting-edge hardware, we’d love to hear from you!


Responsibilities and Duties

  • Implement latest machine learning models and optimise them for performance and accuracy, scaling to 1000s of accelerators.
  • Test and evaluate new internal software releases, provide feedback to software engineering teams, make necessary code fixes, and conduct code reviews.
  • Benchmark models and key ML techniques to identify performance bottlenecks and improve model efficiency.
  • Design and conduct experiments on novel AI methods, implement them and evaluate results.
  • Collaborate with Research, Software, and Product teams to define, build, and test Graphcore’s next generation of AI hardware.
  • Engage with AI community and keep in touch with the latest developments in AI.

  

Candidate Profile

Essential:

  • Bachelor/Master's/PhD or equivalent experience in Machine Learning, Computer Science, Maths, Data Science, or related field.
  • Proficiency in deep learning frameworks like PyTorch/JAX.
  • Strong Python or C++ software development skills
  • Expertise in deep learning from model training to optimisation and evaluation.
  • Experience in distributed training or inference of ML models across 64+ accelerators.
  • Capable of designing, executing and reporting from ML experiments.
  • Developed deep understanding of performance bottlenecks and how to overcome them.
  • Ability to move quickly in a dynamic environment
  • Enjoy cross-functional work collaborating with other teams.
  • Strong communicator - able to explain complex technical concepts to different audiences.

Desirable:

  • Experience in one or more of:
    • MLOps for Kubernetes-based clusters
    • Building production systems with large language models
    • Efficient computing based on low-precision arithmetic.
  • Experience writing C++/Triton/CUDA kernels for performance optimisation of ML models.
  • Familiarity with HPC systems and networking including Infiniband, NVLink, RoCE technologies.
  • Have contributed to open-source projects or published research papers in relevant fields.
  • Knowledge of cloud computing platforms.
  • Keen to present, publish and deliver talks in the AI community.
Benefits

In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.

Applicants for this position must hold the right to work in the UK. Unfortunately at this time, we are unable to provide visa sponsorship or support for visa applications

Top Skills

C++
Cuda
Jax
Python
PyTorch
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
389 Employees
Year Founded: 2016

What We Do

Graphcore has created a new processor, the Intelligence Processing Unit (IPU), specifically designed for artificial intelligence. The IPU’s unique architecture means developers can run current machine learning models orders of magnitude faster. More importantly, it lets AI researchers undertake entirely new types of work, not possible using current technologies, to drive the next great breakthroughs in general machine intelligence.

Our next generation 3D Wafer-on-Wafer Bow IPU systems are helping AI innovators worldwide to build better, more innovative AI solutions, whether their focus is on language and vision, exploring graph neural networks and LSTMs or creating something entirely new.

We believe our IPU technology will become the worldwide standard for artificial intelligence compute. The performance of Graphcore’s IPU is going to be transformative across all industries and sectors whether you are a medical researcher, roboticist or building autonomous cars.

Our team is at the forefront of the artificial intelligence revolution, enabling innovators from all industries and sectors to expand human potential with technology. What we do, really makes a difference.

We're always interested in hearing from exceptional people to join our team.

Similar Jobs

TransUnion Logo TransUnion

Facilities Administrator

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Alderley Edge, Cheshire, England, GBR
13000 Employees

Wise Logo Wise

Commercial Partnerships Senior Manager

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
London, Greater London, England, GBR
6500 Employees

Wise Logo Wise

Senior Software Engineer

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
London, Greater London, England, GBR
6500 Employees
85K-105K Annually

Wise Logo Wise

Executive Assistant

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
London, Greater London, England, GBR
6500 Employees

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account