Senior Machine Learning Platform Engineer

Reposted 3 Days Ago
Be an Early Applicant
32 Locations
In-Office
100K-100K Annually
Mid level
Artificial Intelligence
The Role
The Machine Learning Platform Engineer will manage cloud infrastructure, support AI researchers, and deploy models in production, focusing on MLOps and CI/CD practices.
Summary Generated by Built In
Welcome to the video first world

From your everyday PowerPoint presentations to Hollywood movies, AI will transform the way we create and consume content. Today, people want to watch and listen, not read — both at home and at work. If you’re reading this and nodding, check out our brand video.

Despite the clear preference for video, communication and knowledge sharing in the business environment are still dominated by text, largely because high-quality video production remains complex and challenging to scale—until now….

Meet Synthesia

We're on a mission to make video easy for everyone. Born in an AI lab, our AI video communications platform simplifies the entire video production process, making it easy for everyone, regardless of skill level, to create, collaborate, and share high-quality videos. Whether it's for delivering essential training to employees and customers or marketing products and services, Synthesia enables large organizations to communicate and share knowledge through video quickly and efficiently. We’re trusted by leading brands such as Heineken, Zoom, Xerox, McDonald’s and more. Read stories from happy customers and what 1,200+ people say on G2.

In February 2024, G2 named us as the fastest growing company in the world. Today, we're at a $2.1bn valuation and we recently raised our Series D. This brings our total funding to over $330M from top-tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook.

About the role

We’re looking for an experienced Machine Learning Platform Engineer to join our MLOps team at Synthesia. MLOps a group that enables our AI researchers and engineers to build, train, serve, and deploy state-of-the-art generative models at scale.

You’ll own critical infrastructure across both research and production, helping bridge our DevOps and MLOps domains. You can expect to work across cloud infrastructure, CI/CD pipelines, observability, and tooling, with autonomy to identify and fix bottlenecks in a fast-moving AI company.

This is a hands-on senior IC role (roughly level 5 scope). You’ll be joining a growing team that’s shifting from enablement to direct execution, and you’ll help shape how we scale our infrastructure over the next year.

What you'll do 
  • Manage and evolve our AWS (and some GCP) cloud environments, balancing reliability, cost, and velocity.

  • Maintain and scale Kubernetes (EKS) clusters — managing workloads, deployments, and monitoring at production scale.

  • Own and improve our CI/CD systems (GitHub Actions on our self-hosted AWS runners).

  • Define and implement Infrastructure as Code using Terraform and Terragrunt.

  • Strengthen observability via Datadog and enable teams to understand their systems in production.

  • Collaborate with AI researchers to deploy and monitor ML models — no prior ML experience required.

  • Drive FinOps practices: vendor management, cost allocation, and financial feedback loops.

  • Contribute to internal tooling, automation, and reporting platforms that improve developer experience.

You’ll thrive in this role if you have: 
  • Deep hands-on DevOps / SRE / Platform experience in a SaaS or high-traffic product environment.

  • Strong Kubernetes experience - spinning up and managing clusters, not just consuming them.

  • Proven AWS and or GCP expertise. 

  • Proficiency with Terraform / Terragrunt, Linux, and Python scripting.

  • Strong understanding of CI/CD design patterns.

  • Experience with Datadog or similar observability tooling.

  • Comfortable operating autonomously in ambiguous environments.

  • A pragmatic mindset - focusing on scalable, maintainable solutions over theoretical perfection.

  • A bias toward execution and written communication, especially in remote contexts.

Bonus points for:

  • Familiarity with Temporal.io, or workflow orchestration frameworks.

  • Light frontend or tooling development experience (React, Node.js).

  • Previous work supporting AI research or data-intensive environments

Our culture

At Synthesia we’re passionate about building, not talking, planning or politicising. We strive to hire the smartest, kindest and most unrelenting people and let them do their best work without distractions. Our work principles serve as our charter for how we make decisions, give feedback and structure our work to empower everyone to go as fast as possible. You can find out more about these principles here.

The hiring process:
  1. 30min call with a technical recruiter
  2. 45min call with engineering lead for MLOps to discuss your past projects
  3. Take-home assignment - does not have a deadline and it is syntax agnostic
  4. 60min technical discussion
  5. 30min call with leadership 

Other important info:

  • This is a remote role from an EU country, UK or Switzerland or hybrid from one of our London, Munich, Copenhagen, or Zurich hubs. 
  • This is full-time employment only - no contractors possible - usually through OysterHR or a local entity.
  • We only sponsor visas if you are in the UK or some EU countries already. 

Top Skills

AWS
Datadog
Github Actions
Kubernetes
Node.js
Python
Temporal.Io
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
428 Employees
Year Founded: 2017

What We Do

Synthesia is the #1 rated AI video communications platform. Thousands of companies use it to create videos in 140 languages, saving up to 80% of their time and budget. 👉 Trusted by Zoom, Xerox, Teleperformance, Amazon and mor

Similar Jobs

Pfizer Logo Pfizer

Senior Data Manager, Clinical Data Management

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
2 Locations
121990 Employees

MacPaw Logo MacPaw

Director of Data & Analytics

Information Technology • Security • Software • Cybersecurity • App development • Data Privacy
Remote or Hybrid
28 Locations
550 Employees

MacPaw Logo MacPaw

Chief Revenue Officer

Information Technology • Security • Software • Cybersecurity • App development • Data Privacy
Remote or Hybrid
28 Locations
550 Employees

Pfizer Logo Pfizer

Manager, Resource Demand

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
Pylaía, GRC
121990 Employees

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account