Machine Learning Engineer, Training Infrastructure

Reposted 4 Days Ago
San Francisco, CA
In-Office
Mid level
Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
A generative media company building the AI-native creation platform around the world's first omnimodal foundation model.
The Role
The ML Engineer will manage the infrastructure for training ML models, optimize performance, and ensure scalability for large datasets in collaboration with research teams.
Summary Generated by Built In

About Hedra

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content creation and build a generational company together. We value startup energy, initiative, and the ability to turn bold ideas into real products. Our team is fully in-person in SF/NY with a shared love for whiteboard problem-solving.

Overview

We are looking for an ML Engineer with 3+ YOE in high-performance computing systems to manage and optimize our computational infrastructure for training and deploying our machine learning models. The ideal candidate has diverse experience managing ML workloads at scale, supporting our 3DVAE and video diffusion models. We encourage you to apply even if you don't meet every requirement — we value curiosity, creativity, and the drive to solve hard problems.

Responsibilities

  • Design, implement, and maintain scalable computing solutions for training and deploying ML models, ensuring infrastructure can handle large video datasets.

  • Manage and optimize the performance of our computing clusters or cloud instances, such as AWS or Google Cloud, to support distributed training.

  • Ensure that our infrastructure can handle the resource-intensive tasks associated with training large generative models.

  • Monitor system performance and implement improvements to maximize efficiency and utilization, using tools like Airflow for orchestration.

  • Collaborate across research teams to understand their computational needs and provide appropriate solutions, facilitating seamless model deployment.

Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, with a focus on system administration.

  • Experience with cloud computing platforms such as Amazon Web Services, Google Cloud, or Microsoft Azure, essential for managing large-scale ML workloads.

  • Values engineering processes and version control (CI/CD).

  • Knowledge of containerization technologies like Docker and Kubernetes required for deployments at scale.

  • Understanding of distributed training techniques and how to scale models across multi-node clusters aligning with video generation needs.

  • Strong problem-solving and communication skills, given the need to collaborate with diverse teams.

This role is vital for ensuring the computational backbone supports the company’s ML efforts, focusing on deployment and scalability.

Benefits

  • Competitive compensation + equity

  • 401k (no match)

  • Healthcare (Silver PPO Medical, Vision, Dental)

  • Lunch and snacks at the office

Top Skills

Airflow
AWS
Docker
GCP
High-Performance Computing
Kubernetes
Machine Learning
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
14 Employees
Year Founded: 2023

What We Do

Hedra is an AI native platform for multimodal creation. The platform is built around their own cutting-edge proprietary video model, Character-3, which is the first multimodal model in production. Alongside Character-3, the platform also brings other leading foundation models into one ecosystem spanning generative images, video, and audio. Prosumer and enterprise users leverage Hedra to generate content ranging from viral social media to branded content marketing.

Why Work With Us

We're an early-stage team that moves very fast and is building at the leading edge of AI/Media. Every employee takes on a lot of ownership and has an opportunity to learn and grow rapidly.

Gallery

Gallery

Hedra Offices

OnSite Workspace

Hedra's main office is in San Francisco and secondary hub is in New York.

Typical time on-site: None
HQHQ
New York, New York
Learn more

Similar Jobs

Hedra Logo Hedra

Back-end Engineer

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
In-Office
2 Locations
14 Employees
175K-275K Annually

Hedra Logo Hedra

Front-end Engineer

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
In-Office
2 Locations
14 Employees
175K-275K Annually

Hedra Logo Hedra

Machine Learning Engineer

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
In-Office
San Francisco, CA, USA
14 Employees

Hedra Logo Hedra

Scientist

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
In-Office
San Francisco, CA, USA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account