Research Scientist (Singapore)

Posted 19 Days Ago
Be an Early Applicant
Singapore, SGP
In-Office
Expert/Leader
Artificial Intelligence • Software
The Role
As a Research Scientist, you will lead research on video generation models, build data systems, and collaborate on model improvements, while driving project ideation and validation.
Summary Generated by Built In

About Cantina:

Cantina Labs is a social AI company, developing a suite of advanced real-time models that push the boundaries of expression, personality, and realism. We bring characters to life, transforming how people tell stories, connect, and create. We build and power ecosystems. Cantina, our flagship social AI platform, is just the beginning.

About the Role:

Cantina is expanding, and we're looking for a Research Scientist to join our growing Singapore team! In this role, you will drive foundational research on video generation models, taking ownership across the full research cycle and driving post-training research. Furthermore, you'll collaborate closely with data, infrastructure, and adjacent modeling teams to translate research findings into durable model improvements.

What You’ll Do:

  • Build and maintain scalable systems for ingesting, preprocessing, and delivering large-scale video data for model training

  • Design and scale distributed data pipelines for preprocessing, dataset generation, and repeated dataset refreshes

  • Own workflow orchestration, job scheduling, monitoring, and failure recovery for large-scale data processing jobs

  • Implement and maintain containerized pipeline infrastructure using Kubernetes or equivalent orchestration systems

  • Optimize cloud-based data storage and movement across providers (AWS, GCS, or Azure) for cost, throughput, and operational efficiency

  • Define and implement best practices for dataset storage layout, versioning, caching, retention, and access patterns

  • Build tooling to support deduplication workflows at scale, including near-dedup pipelines over large video corpora

  • Research and develop distillation methods for large-scale diffusion and flow-based video generation models, including guidance distillation and adversarial distillation, with a focus on preserving or improving generation quality while reducing inference cost

  • Develop reward models and preference-based fine-tuning pipelines that align video generation quality with human judgments across dimensions such as aesthetics, motion quality, and prompt adherence

  • Analyze the relationship between base model behavior and post-training outcomes, and work with the foundation model team to inform pretraining decisions accordingly

What You’ll Bring:

  • Strong hands-on experience building or scaling large-scale data systems or pipelines for machine learning workflows

  • Experience with distributed data processing frameworks such as PySpark or Ray, and orchestration tools such as Airflow or equivalent

  • Familiarity with containerization and container orchestration, including Docker and Kubernetes

  • Experience working with cloud-based data storage and compute (AWS, GCS, and/or Azure), including tradeoffs around cost, throughput, storage layout, and access patterns

  • Familiarity with video and media processing tools such as FFmpeg, PyAV, DALI, or OpenCV

  • Familiarity with multimodal or media data, including video, image, text, and audio

  • Strong research background in post-training methods for large-scale diffusion or flow-based generative models, with deep hands-on experience in distillation across both inference efficiency and quality preservation

  • Experience with reward modeling or preference-based fine-tuning for generative models, including RLHF, DPO or equivalent alignment approaches

  • Solid understanding of the interplay between pretraining and post-training, and how base model properties affect distillation and fine-tuning outcomes

  • Proficiency in Python and modern machine learning frameworks, with a strong preference for PyTorch or JAX

  • Track record of independent research, with the ability to drive projects from initial idea through experimental validation

  • Publications at top-tier venues (NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV) preferred

  • Good understanding of the practical challenges involved in building reliable, scalable, and reproducible data workflows for machine learning systems

Benefits We Offer:

  • Competitive salary and generous company equity

  • Personal time off and paid holidays

  • Health insurance

  • Global travel insurance: Covers you when traveling internationally

  • Monthly spending stipend: $500 (~S$635)

  • Equipment: All equipment needed for your home office

Skills Required

  • Strong hands-on experience building or scaling large-scale data systems
  • Experience with distributed data processing frameworks such as PySpark or Ray
  • Experience with cloud-based data storage and compute
  • Familiarity with media processing tools such as FFmpeg or OpenCV
  • Proficiency in Python and modern machine learning frameworks
  • Track record of independent research with publications at top-tier venues
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
364 Employees
Year Founded: 2023

What We Do

Cantina Labs, founded by Sean Parker, is a new social platform with the most advanced AI character creator. Build, share, and interact with AI bots and your friends directly in the Cantina or across the internet. Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet. Recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters. Bots are a new media type that offer a way for creators to share infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.

Similar Jobs

Mistral AI Logo Mistral AI

Scientist

Artificial Intelligence
In-Office
Singapore, SGP
92 Employees

Adyen Logo Adyen

Team Lead

Fintech • Payments • Financial Services
Easy Apply
Hybrid
Singapore, SGP
4771 Employees
Hybrid
Singapore, SGP
897 Employees

Micron Technology Logo Micron Technology

ENGINEER - F10 FAC MECHANICAL

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office
Singapore, SGP
45000 Employees

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account