Model Behavior Engineer

Reposted 8 Days Ago
2 Locations
Hybrid
98K-140K Annually
Mid level
Artificial Intelligence • Productivity • Software
Notion is the AI workspace where teams and AI agents get more done together.
The Role
Owner of Notion AI quality: design and iterate prompts and context strategies, analyze production data, build evals and metrics, evaluate/launch models with labs, and drive cross-functional quality improvements.
Summary Generated by Built In
Who We Are

Notion is the collaborative AI workspace where teams and agents think together. We're building one place where your knowledge, projects, meetings, and AI tools live side by side, so work feels faster, clearer, and less fragmented. Millions of individuals, small teams, and large companies run their work on Notion.

Notinos (our employees) are customer zero in bringing this future of work to life. We care about craft, humanity, and building things that last — not just shipping the next feature, but setting a standard for how modern teams (with humans and agents working together) think and execute.

About the Role

You'll own the quality bar for Notion AI products. You’ll work with product and engineering teams to build systems to define what “good” looks like, measure our progress, and drive changes to deliver reliable and high-quality AI experiences. Your work directly shapes how Notion's AI products behave for millions of users.

This isn't a traditional software engineering role. It’s an art & science role. You won't spend your days writing code. Instead, you'll focus on understanding and shaping how our AI products behave through context engineering, designing evaluation systems, and analyzing data. This team sits in our AI engineering team, working directly with engineering, product, design, and data.

This role is a unique blend of ops, strategy, and product thinking. Day to day, you'll live in production data, ship prompt fixes, run evals and, in effect, shape our quality strategy. As part of that you'll shape Notion's model strategy and work directly with frontier AI labs (OpenAI, Anthropic, Google) to evaluate and launch new models.

We're looking for problem-seeking generalists interested in 0 → 1: curious people with high agency who thrive in ambiguous, fast-moving product areas. We're building a product, but also building a new function. You'll have real ownership from day one and help write the playbook as we scale.

What You'll Achieve
  • Context engineering — Design, test, and iterate on system prompts, tool prompts, and context strategies that shape how Notion's AI products behave. Understand the nuances of how models respond to different context structures and use that knowledge to drive quality improvements directly.

  • Understand & debug — Live in production data: transcripts, logs, user feedback. Reproduce issues, identify root causes, and translate symptoms into actionable problem statements. Find signal in noisy data.

  • Build evals & Measurement — Design eval strategies, build datasets, run evaluations. Track quality over time. Identify issues before users do. Own the loop: define quality goals, create evals, test and improve

  • Evaluate and launch new models with leading research labs — Evaluate and launch models from OpenAI, Anthropic, Google, and others. Benchmark across dimensions: quality, latency, cost, edge cases. Help shape Notion's model strategy based on real data.

  • Drive quality priorities — Work embedded with eng and product teams to surface the most important issues. Own the quality narrative: severity, frequency, what to fix and why. Be the voice of quality in the room.

  • Build tooling & systems — Help manage AI observability and eval platforms (e.g., Braintrust). Build the playbooks and tools that enable all teams at Notion to build AI products.

Skills You’ll Need to Bring
  • Driver mentality — You treat problems as yours. If something's broken, it's your job to fix it, even if you didn't cause it. You have a bias to action.

  • Curiosity -You’re excited about exploring the “jagged frontier” of LLM capabilities and how AI products meet reality

  • Analytical instinct — Your first move is to look at data. You can find signal in noise.

  • Comfortable working with data — You can self-serve insights from large datasets, whether through SQL, coding agents, or other tools.

  • Clear communication — You can explain complex issues simply.

  • Experience with LLMs, prompting, or AI products

Nice to Haves
  • Backgrounds in engineering, product, data science, research, consulting

  • You've built something on your own to solve a problem — side project, startup, tool, whatever

Notion is committed to providing highly competitive cash compensation, equity, and benefits. The compensation offered for this role will be based on multiple factors such as location, the role’s scope and complexity, and the candidate’s experience and expertise, and may vary from the range provided below. For roles based in San Francisco or New York City, the estimated base salary range for this role is $98,000 - $140,000 per year.

By clicking "Submit Application", I understand and agree that Notion and its affiliates and subsidiaries will collect and process my information in accordance with Notion's Global Recruiting Privacy Policy and NYLL 144. #LI-Onsite

A Note on AI

You don’t need deep AI expertise for every role, but we do expect every Notino to be intellectually curious, drawn to tinkering and discovery, and excited to use AI as a real collaborator in their work. For some roles, AI fluency is a core requirement — when that’s the case, we’ll make it explicit in the qualifications. People who thrive here don’t treat AI as a novelty. They use it to think better, move faster, and build more creatively.

Equal Opportunity & Accommodations

We hire talented and passionate people from a variety of backgrounds because we want our teams to reflect the wide diversity of our customers. If you’re excited about a role but your experience doesn’t align perfectly with every bullet point listed, we still encourage you to apply.

Notion is proud to be an equal opportunity employer. We do not discriminate in hiring or any employment decision based on race, color, religion, national origin, age, sex (including pregnancy, childbirth, or related medical conditions), marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity or expression, sexual orientation, or other applicable legally protected characteristic. Notion considers qualified applicants with criminal histories, consistent with applicable federal, state and local law. Notion is also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please let your recruiter know.

Skills Required

  • Driver mentality (bias to action, ownership of problems)
  • Curiosity about LLM capabilities and AI product behavior
  • Analytical instinct; ability to find signal in noisy data
  • Comfortable working with data and self-serving insights (e.g., SQL, coding agents)
  • Clear written and verbal communication
  • Experience with LLMs, prompting, or AI products
  • Ability to design, test, and iterate system prompts and context strategies
  • Experience designing eval strategies, building datasets, and running evaluations
  • Backgrounds in engineering, product, data science, research, or consulting
  • Side projects, startups, or independently built tools demonstrating initiative

What the Team is Saying

Alma
Penny
Marlene

Notion Compensation & Benefits Highlights

  • Healthcare Strength Health coverage is described as fully employer‑paid for employees and dependents, with mental‑health support and transgender healthcare available. Additional wellness access like therapy/coaching, medication management, and services such as One Medical are highlighted.
  • Parental & Family Support Paid parental leave is offered for biological, adoptive, and foster parents, alongside employer‑sponsored fertility benefits and adoption support. A post‑parental return‑to‑work program and family‑focused perks indicate breadth of coverage.
  • Equity Value & Accessibility Equity is positioned as meaningful, with late‑stage tender offers enabling employees (and some alumni) to sell shares, increasing practical access to value. Compensation emphasizes market‑aligned equity alongside cash, reinforcing equity’s role in total rewards.

Notion Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
1,000 Employees
Year Founded: 2016

What We Do

Notion blends your everyday work tools into one. Product roadmap? Company wiki? Meeting notes? With Notion, they're all in one place, and totally customizable to meet the needs of any workflow. It's the all-in-one workspace for you, your team, and your whole company. Mission: We humans are toolmakers by nature, but most of us can't build or modify the software we use every day — arguably our most powerful tool. Here at Notion, we're on a mission to make it possible for everyone to shape the tools that shape their lives.

Why Work With Us

Here at Notion, our work shapes our culture and our culture inspires our work. We seek to hire creative toolmakers that want to be the best in their craft. If every employee is able to focus on being the best toolmaker in their craft, we'll be able to achieve our mission of enabling the world to better solve its problems.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Notion Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Employees work in-person at our offices on Mondays, Tuesdays and Thursdays. The other two days are flexible.

Typical time on-site: 3 days a week
Company Office Image
HQSan Francisco, CA
Company Office Image
Dublin, Dublin
Company Office Image
Hanyang, KR
Company Office Image
Hyderabad, Hyderabad
Company Office Image
New York, NY
Company Office Image
Tokyo, Tokyo
Learn more

Similar Jobs

Notion Logo Notion

Forward Deployed Engineer, GTM

Artificial Intelligence • Productivity • Software
Hybrid
2 Locations
1000 Employees

Notion Logo Notion

Designer

Artificial Intelligence • Productivity • Software
Hybrid
New York, NY, USA
1000 Employees
204K-228K Annually

Notion Logo Notion

Head of Support, AMER & EMEA

Artificial Intelligence • Productivity • Software
Hybrid
2 Locations
1000 Employees
220K-260K Annually

Notion Logo Notion

Consultant

Artificial Intelligence • Productivity • Software
Hybrid
2 Locations
1000 Employees
175K-225K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account