Machine Learning Engineer (CUDA)

Job Posted 13 Hours Ago Posted 13 Hours Ago
Be an Early Applicant
2 Locations
Mid level
Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
A generative media company building the AI-native creation platform around the world's first omnimodal foundation model.
The Role
As a CUDA ML Engineer at Hedra, you will optimize machine learning models for GPU performance, specifically focusing on 3DVAE and video diffusion models. Responsibilities include developing algorithms for efficient GPU computation, collaborating with research and engineering teams, and staying updated on GPU technology advancements to enhance model efficiency.
Summary Generated by Built In

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

Note: At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content and build a generational company together. You should have start-up experience and be a self-starter that is driven to build impactful products that change the status quo. You must be willing to work in-person in either NYC or SF.

Overview:

We are seeking a talented CUDA ML Engineer to optimize our machine learning models for high-performance computing on GPU hardware. The ideal candidate will have expertise in CUDA programming and a deep understanding of how to leverage GPU acceleration to maximize the efficiency of our 3DVAE and video diffusion models.

Responsibilities:

  • Optimize machine learning models, specifically 3DVAE and video diffusion models, for GPU performance using CUDA, ensuring efficient training and inference.

  • Develop and implement efficient algorithms and data structures for GPU computation, addressing performance bottlenecks in video generation tasks.

  • Work closely with the research and engineering teams to understand model requirements and performance bottlenecks, facilitating collaboration.

  • Stay current with the latest advancements in GPU technology and machine learning optimization techniques.

  • Ensure that our models run efficiently on various GPU architectures, supporting scalability for large-scale training.

Qualifications:

  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related field, with a focus on high-performance computing.

  • Strong programming skills in C++ and CUDA, essential for GPU optimization.

  • Experience with deep learning frameworks that support GPU acceleration, such as PyTorch or TensorFlow, crucial for model implementation.

  • Understanding of parallel computing concepts and GPU architecture, given the need to optimize for hardware constraints.

  • Familiarity with machine learning models, particularly generative models, to align optimizations with model needs.

  • Excellent problem-solving and debugging skills, necessary for addressing performance issues.

Benefits:

  • Competitive compensation and equity

  • 401k (no match)

  • Healthcare (Silver PPO Medical, Vision, Dental)

  • Lunch and snacks at the office

We encourage you to apply even if you don't fully meet all the listed requirements; we value potential and diverse perspectives, and your unique skills could be a great asset to our team.

Top Skills

C++
Cuda
PyTorch
TensorFlow
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
14 Employees
On-site Workplace
Year Founded: 2023

What We Do

Hedra is an AI native platform for multimodal creation. The platform is built around their own cutting-edge proprietary video model, Character-3, which is the first multimodal model in production. Alongside Character-3, the platform also brings other leading foundation models into one ecosystem spanning generative images, video, and audio. Prosumer and enterprise users leverage Hedra to generate content ranging from viral social media to branded content marketing.

Why Work With Us

We're an early-stage team that moves very fast and is building at the leading edge of AI/Media. Every employee takes on a lot of ownership and has an opportunity to learn and grow rapidly.

Gallery

Gallery

Hedra Offices

OnSite Workspace

Hedra's main office is in San Francisco and secondary hub is in New York.

Typical time on-site: None
HQHQ
New York
Learn more

Similar Jobs

Hedra Logo Hedra

Applied Research Scientist

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
2 Locations
14 Employees

Hedra Logo Hedra

Senior Research Engineer

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
2 Locations
14 Employees

Hedra Logo Hedra

Senior Full-Stack Engineer

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
2 Locations
14 Employees

Hedra Logo Hedra

Senior Frontend Engineer

Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
2 Locations
14 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account