Research Scientist (post-training)

Reposted 5 Days Ago
Be an Early Applicant
San Francisco, CA
In-Office
Expert/Leader
Artificial Intelligence • Generative AI
The Role
Seeking a Research Scientist to lead initiatives on alignment and post-training of video generation models, focusing on quality, safety, and human alignment.
Summary Generated by Built In

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.

Role overview:

We are seeking an exceptional Research Scientist to join our team, focusing on alignment and post-training techniques for large-scale video generation models. In this role, you will be at the forefront of ensuring our diffusion-based video models reliably produce high-quality, physically accurate and safe outputs that match human preferences and values.

Key responsibilities:
  • Lead research initiatives in alignment and post-training methods for video generation models, focusing on improved quality, reliability, and adherence to human intent

  • Design and implement supervised fine-tuning and reinforcement learning from human feedback (RLHF) pipelines for video generation models

  • Develop robust evaluation frameworks to measure model alignment, safety, and output quality

  • Create and optimize data collection pipelines for human feedback and preferences

  • Design and conduct experiments to validate alignment techniques and their scaling properties

  • Collaborate with cross-functional teams to integrate alignment improvements into our production pipeline

  • Stay at the cutting edge of the field by regularly reviewing academic literature in both generative AI and alignment

  • Mentor junior researchers and foster a culture of responsible AI development

  • Work closely with product teams to ensure alignment methods enhance rather than inhibit model capabilities

Qualifications:
  • Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related field

  • Must have:

    • Strong publication record in top-tier conferences (e.g., NeurIPS, ICML, ICLR) with a focus on reinforcement learning, alignment, or generative models

    • Extensive experience implementing and optimizing large-scale training pipelines using PyTorch

    • Deep understanding of reinforcement learning techniques, particularly RLHF

    • Experience with distributed training systems and large-scale experiments

    • Proven track record in designing and implementing robust evaluation frameworks

    • Excellent communication skills with the ability to explain complex technical concepts to diverse audiences

    • Strong software engineering skills and experience with complex shared codebases

  • Ideal candidate will have:

    • Experience with diffusion models or other generative architectures

    • Background in fine-tuning large language models or generative models

    • Experience working with human feedback data collection and annotation pipelines

    • Strong aesthetic sense and understanding of video quality assessment

    • Familiarity with alignment techniques such as constitutional AI or debate

    • Track record of successful collaboration with product teams

    • Experience with perceptual quality metrics and human evaluation design

    • Contributions to open-source projects in AI alignment or generative AI

      Additional Information

    The role is based in the Bay Area (San Francisco). Candidates are expected to be located near the Bay Area or open to relocation.

Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish.

Top Skills

Human Feedback
Python
PyTorch
Reinforcement Learning
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, CA
50 Employees

What We Do

Enabling the next billion AI video creators with Genmo

Similar Jobs

Verkada Inc Logo Verkada Inc

Enterprise Account Executive

Cloud • Hardware • Security • Software
In-Office or Remote
4 Locations
2000 Employees
220K-280K Annually

Square Logo Square

Account Executive

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote or Hybrid
8 Locations
12000 Employees
130K-234K Annually

Cash App Logo Cash App

Operations Manager

Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Remote or Hybrid
8 Locations
3500 Employees
153K-270K Annually

Block Logo Block

Sales Manager

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
In-Office or Remote
8 Locations
12000 Employees
115K-208K Annually

Similar Companies Hiring

Granted Thumbnail
Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account