Research Scientist Intern - Post-Training (RLHF)

Reposted 13 Days Ago
Be an Early Applicant
France
Internship
Artificial Intelligence • Marketing Tech • Software • Generative AI • Automation
Jasper elevates all marketing and all marketers with the power of AI
The Role
As a Research Scientist Intern, you will advance image-generation techniques using RLHF methods, participate in training models, and contribute to open-source projects while collaborating with a research team.
Summary Generated by Built In

Jasper is the leading AI marketing platform, enabling the world's most innovative companies to reimagine their end-to-end marketing workflows and drive higher ROI through increased brand consistency, efficiency, and personalization at scale.

Jasper has been recognized as "one of the Top 15 Most Innovative AI Companies of 2024" by Fast Company and is trusted by nearly 20% of the Fortune 500 – including Prudential, Ulta Beauty, and Wayfair. Founded in 2021, Jasper is a remote-first organization with team members across the US, France, and Australia.

About The Role

Jasper Research is seeking a highly motivated intern to advance the frontiers of open source image-generation and image-editing applications. In this role, you will be instrumental in developing new state-of-the-art open-source text-to-image and image editing models while collaborating closely with our talented team of researchers and engineers. The internship duration is 6 months.

We have historically relied on third-party existing open-source image foundational models for its core applications (replace-background, image uncropping, image relighting …), mastering model fine-tuning methods. One of the team’s current core projects consists now of building our own open-weights foundational text-to-image model from scratch.

This role is open to candidates located in Paris. It will be a hybrid setup, which requires you to come into the office when necessary.

What you will do at Jasper

As a Research Scientist Intern, you will work closely with our research team to design the best-suited RLHF techniques for diffusion models. This role offers a unique opportunity to contribute to the development of a foundational open-source model, addressing challenges in scalability, fidelity, and generalization. You will engage in both theoretical and applied research, collaborating with experts in machine learning, computer vision, and natural language processing. The main goals of this internship are

  1. Research & Development: Conduct literature reviews, propose and implement innovative methods to fine-tune text-to-image models with reinforcement learning methods (RLHF, DPO)

  2. Model Training & Evaluation: Participate in the fine-tuning stages of the training of large-scale text-to-image models, conduct rigorous ablations, design evaluation metrics, and analyze model performance.

  3. Documentation & Communication: Document research findings, prepare technical reports, and participate in the external communication of the results.

  4. Open Source & Community: Contribute to an ambitious open-source project, publish research findings, and engage with the broader AI community.

What you will bring to Jasper
  • Currently enrolled in a Ph.D. or M.Sc program in Machine Learning, applied mathematics or computer science (Ph.D, preferred).

  • Experience with RLHF techniques with application to either Large Language Models or Diffusion Models.

  • A genuine interest in the field and a strong motivation to contribute to open-source initiatives, with a potentially proven track record through personal projects or previous experience in deep learning, especially generative models (e.g., diffusion models, GANs, VAEs, transformers).

  • Strong coding abilities in Python and deep learning frameworks (PyTorch, TensorFlow, JAX).

  • The ideal candidate will possess a strong critical thinking and problem-solving mindset, coupled with excellent teamwork skills.

  • Being available for period of 6 months.

Nice to have

  • Strong programming Python skills, including software engineering best practices to produce high-quality code.

  • Experience with distributed training and large-scale systems on GPU clusters

  • Experience with large-scale data processing

  • Contributions to open-source projects.

  • Proven track record of achieving significant results as demonstrated by first-authored publications in major conferences and journals such as CVPR, ECCV, ICCV, ICLR, NeurIPS

Benefits & Perks
  • Flexible, hybrid work environment. Our office is based at Station F in Paris, the vibrant hub of the French startup ecosystem. Our efficient and lean team at Station F thrives on innovation and collaboration.

  • Competitive compensation package

Top Skills

Jax
Python
PyTorch
TensorFlow
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Boise, ID
220 Employees
Year Founded: 2020

What We Do

Jasper is the industry's first intelligent, intuitive workspace for marketers. Powered by intelligent Agents, Jasper helps redefine marketing processes while providing a rich context layer, ensuring content is resonant and compliant – even at scale.

Why Work With Us

Jasper is a remote-first company where you can innovate, learn, and grow in the fast-moving world of AI. As part of our startup team, you’ll make real impact, shape the future of our products, and advance your career in a transformative industry. We value curiosity, collaboration, and bold ideas, making Jasper the place to do meaningful work in AI.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Jasper Offices

Remote Workspace

Employees work remotely.

Typical time on-site: None
United States
Australia
France
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account