Research Scientist (diffusion)

Reposted 4 Days Ago
Be an Early Applicant
San Francisco, CA
In-Office
Expert/Leader
Artificial Intelligence • Generative AI
The Role
Lead research on advanced diffusion models for text-to-video. Develop algorithms, validate ideas, collaborate on integrating breakthroughs, and publish results.
Summary Generated by Built In

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.

Role overview:

We are seeking an exceptional Research Scientist to join our team, focusing on developing cutting-edge diffusion models for text-to-video generation. In this role, you will be at the forefront of innovation, creating novel architectures and algorithms that transform written descriptions into stunning, coherent video content.

Key responsibilities:
  • Lead research initiatives in advanced diffusion models for text-to-video generation, focusing on improving visual quality, temporal consistency, and semantic fidelity

  • Develop and implement state-of-the-art algorithms for translating textual descriptions into dynamic video content

  • Design and conduct rigorous experiments to validate new ideas and evaluate model performance

  • Collaborate with cross-functional teams to integrate research breakthroughs into our production pipeline

  • Stay at the cutting edge of the field by regularly reviewing academic literature and attending top-tier conferences

  • Contribute to the research community through high-quality publications and open-source contributions

  • Mentor junior researchers and foster a culture of innovation within the research team

  • Work closely with product teams to align research directions with user needs and market opportunities

Qualifications:
  • Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related field

  • Must have:

    • Strong publication record in top-tier conferences (e.g., CVPR, ICCV, NeurIPS, ICML) with a focus on generative models, particularly diffusion models

    • Extensive experience implementing and optimizing large-scale generative models for image or video tasks

    • Deep understanding of state-of-the-art techniques in text-to-image and text-to-video generation

    • Proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow

    • Excellent communication skills with the ability to explain complex technical concepts to diverse audiences

    • Proven ability to work collaboratively in a team environment

  • Ideal candidate will have:

    • Postdoctoral or industrial research experience in generative AI for video

    • Hands-on experience with text-to-video generation projects

    • Expertise in other generative model architectures (e.g., GANs, VAEs) and their applications to video

    • Experience working with large-scale datasets and distributed computing environments

    • Track record of successful collaboration with product teams on technology transfers

    • Familiarity with video codecs, compression techniques, and perceptual quality metrics

    • Contributions to open-source projects in the field of generative AI

Additional information

The role is based in the Bay Area (San Francisco). Candidates are expected to be located near the Bay Area or open to relocation.

Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish.

Top Skills

Python
PyTorch
TensorFlow
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, CA
50 Employees

What We Do

Enabling the next billion AI video creators with Genmo

Similar Jobs

CrowdStrike Logo CrowdStrike

Sales Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
7 Locations
10000 Employees
65K-90K Annually
Remote or Hybrid
9 Locations
213000 Employees
27-41 Hourly

DraftKings Logo DraftKings

New Business Executive, Market Expansion

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
California, USA
6400 Employees
105K-105K Annually

Navixus (Formerly Eventus Solutions Group) Logo Navixus (Formerly Eventus Solutions Group)

Research Associate

Artificial Intelligence • Natural Language Processing • Professional Services • Analytics • Consulting • Conversational AI • Generative AI
Remote or Hybrid
8 Locations
830 Employees
115K-130K Annually

Similar Companies Hiring

Granted Thumbnail
Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account