Research Scientist - Agency and Reasoning

Reposted 12 Days Ago
Be an Early Applicant
Palo Alto, CA
In-Office
Junior
Information Technology • Software
The Role
As a Research Scientist at Zyphra, you will conduct research in reinforcement learning and language model reasoning, implementing innovative ideas to enhance next-generation AI models.
Summary Generated by Built In
Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role:

As a Research Scientist, you will be a core contributor to Zyphra’s Agency and Reasoning Team. You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at scale to our next generation of language models.

What We’re Looking For:
  • Strong research taste and intuition

  • The ability to work through a research project from conception to execution to write-up

  • Strong implementation and prototyping skillset

  • A researcher who can take an idea from conception to experimentation extremely quickly

  • The ability to work well and cooperate with others in a high-paced research setting

  • Curiosity, interest, and joy in understanding intelligence.

Qualifications:
  • Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks

  • Experience with language model supervised finetuning and preference learning methods such as DPO, simPO, etc.

  • Experience with context-length extension methods

  • A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning

  • Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation

  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics)

  • Previously published machine learning research in well-respected venues

  • Highly proficient with PyTorch and Python

  • We are excited and able to rapidly learn new fields and implement new ideas

  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Why Work at Zyphra:
  • We strongly value new and crazy ideas and are very willing to bet big on new ideas

  • We move as quickly as we can; we aim to minimize the bar to impact as low as possible

  • We all enjoy what we do and love discussing AI

Benefits and Perks:
  • Comprehensive medical, dental, vision, and FSA plans

  • Competitive compensation and 401(k)

  • Relocation and immigration support on a case-by-case basis

  • On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

  • In-person team in Palo Alto, CA, with a collaborative, high-energy environment

Top Skills

Data Engineering
Dpo
Language Models
Python
PyTorch
Reinforcement Learning
Simpo
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, California
35 Employees

What We Do

Zyphra is a full stack AGI company based in Palo Alto, California.

Similar Jobs

NinjaOne Logo NinjaOne

Communications Specialist

Information Technology • Productivity • Software • Infrastructure as a Service (IaaS)
Remote or Hybrid
17 Locations
2000 Employees
80K-90K Annually

Pfizer Logo Pfizer

Scientist

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
7 Locations
121990 Employees
135K-225K Annually

Pfizer Logo Pfizer

Scientist

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
7 Locations
121990 Employees
135K-225K Annually

Cisco Meraki Logo Cisco Meraki

Product Adoption Manager - Content & Journeys (Spaces) (Hybrid)

Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Remote or Hybrid
San Francisco, CA, USA
3000 Employees
134K-255K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account