Software Engineer

Reposted 19 Days Ago
San Francisco, CA, USA
In-Office
Junior
Artificial Intelligence • Information Technology • Software
The Role
As a Software Engineer, you will design evaluation scenarios, influence product development, and contribute to an early-stage startup's engineering team.
Summary Generated by Built In

About Mechanize

Mechanize builds reinforcement learning environments that frontier AI labs use to train and evaluate their coding models. Learn more at mechanize.work.

Why the work matters

AI models have gotten good at narrow coding tasks but still fail at the complex, judgment-heavy parts of software engineering. We build the environments that expose those failures and help models improve.

What you'll do

You'll design, build, and quality-assure RL tasks. Each task is a self-contained software engineering challenge with a prompt, an environment, and an automated grader. You own the full lifecycle: ideation, grading infrastructure, running frontier models against the task, failure analysis, and iteration. At this level, we expect you to consistently produce tasks that target meaningful capability gaps in frontier models, and to develop a strong sense for what makes a task informative versus merely difficult.

You will use coding agents heavily, and a large part of the job is directing them well, evaluating their output, and knowing when they are failing in subtle ways. You may also contribute to shared infrastructure: improving our build pipeline, automating parts of QA, or building tooling for other engineers.

What makes someone good at this

Strong technical fundamentals combined with a well-calibrated intuition for AI model behavior. You need to anticipate where a model will take shortcuts, distinguish genuine capability gaps from grader issues, and understand how a model will interpret a prompt. At this level, we expect extensive familiarity with what frontier coding agents can and can't do.

Good fit if you:

  • Can code in Python

  • Are confident working independently at a consistent pace

  • Have developed an intuition for what coding agents can and can't do

  • No prior ML or AI experience required

Probably not a good fit if you:

  • Want a product engineering role building features for end users

  • Prefer a highly collaborative team environment with shared ownership

  • Want extensive structured mentorship

This is independent, high-ownership work. You own your tasks from start to finish, with regular check-ins and feedback. Strong performers are recognized and promoted quickly. Benefits include health, dental, vision, and life insurance. Applying takes less than one minute.

Interview process: https://www.mechanize.work/how-our-interview-process-works

Learn more about the work: https://www.mechanize.work/what-working-here-is-like

About Mechanize. ~20 person team in San Francisco. Backed by Patrick Collison, Nat Friedman, Daniel Gross, Jeff Dean, Dwarkesh Patel, and Sholto Douglas. Featured in the New York Times, the Dwarkesh Podcast and Hard Fork.

Top Skills

Python
React
Typescript
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
5 Employees

What We Do

We are a software company that builds RL environments and sells them to the leading AI labs.

Similar Jobs

Navan Logo Navan

Software Engineer

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Easy Apply
Hybrid
Palo Alto, CA, USA
3300 Employees
80K-177K Annually

Product.ai Logo Product.ai

Software Engineer

Artificial Intelligence • Big Data • Consumer Web • eCommerce
In-Office
Los Angeles, CA, USA
25 Employees
280K-350K Annually

BlackLine Logo BlackLine

Software Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Remote or Hybrid
Pleasanton, CA, USA
1810 Employees
145K-182K Annually

Boeing Logo Boeing

Software Engineer

Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
In-Office
Huntington Beach, CA, USA
170000 Employees
106K-144K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account