AI Behavior Engineer

Reposted Yesterday
San Francisco, CA, USA
In-Office
310K-500K Annually
Mid level
Artificial Intelligence • Software
The Role
The AI Behavior Engineer will measure and shape AI model behaviors, collaborating with stakeholders to conduct evaluations and adapt methods for various contexts.
Summary Generated by Built In
Salary range: $310,000 - $500,000/year + benefits

Description: Transluce is a fast-moving nonprofit research lab building the public tech stack for scalable AI evaluation and oversight. We specialize in behavioral evaluations of frontier AI systems, assessing how models actually behave in deployment, not just how they perform on benchmarks. We are an independent non-profit with a mission to steer the development of AI for the public good.

About the role: We're looking for an engineer to work on measuring and shaping AI model behaviors, someone who thrives on turning hard questions into evidence fast. Think of this as a forward-deployed engineering role working directly with policymakers, civil society partners, and frontier labs to rapidly answer key questions about why AI systems act the way they do, and when and why they fail. 

You'll build relationships with external domain experts, adapt our methods to new contexts, and help ensure our work is both technically credible and immediately useful to the people making consequential AI governance decisions.

This is a high-autonomy role with direct exposure to senior stakeholders and a clear line of sight from your work to real-world impact.

Core responsibility: Build and extend Transluce’s AI evaluation methods for measuring important evolving AI model behaviors. This includes:
  • Scope, prototype, and run behavioral evaluations in response to emerging policy and oversight needs, including rapid-turnaround work for government and civil society partners.
  • Execute on Transluce's contracts with government evaluators, including building evaluations for harmful manipulation with the EU AI Office.
  • Design and run privileged-access evaluations and external oversight exercises with frontier labs.
  • Work with civil society organizations and domain experts to adapt our behavioral evaluation pipelines to their contexts (e.g., mental health, persuasion, evaluation awareness).

Qualities of a strong candidate:
  • Hands-on experience designing and running AI evaluations, particularly behavioral or interactive evaluations (multi-turn, agentic, or red-teaming contexts)
  • Strong engineering instincts and good judgment about when "good enough to ship" is actually good enough.
  • Experience in customer-facing, consulting, or forward-deployed roles translating ambiguous stakeholder needs into concrete deliverables.
  • Experience running evaluations at scale or in a production context.
  • Ability to understand and balance between the needs of AI researchers and domain experts, as well as between researchers and senior decision makers.
  • Strong communication skills, low ego, openness to giving and receiving feedback.

We are located in San Francisco and enthusiastic to work together in-person. We are open to sponsoring international visas.

Skills Required

  • Experience designing and running AI evaluations, especially behavioral or interactive evaluations
  • Strong engineering instincts and judgment for deliverables
  • Experience in customer-facing, consulting, or forward-deployed roles
  • Experience running evaluations at scale or in a production context
  • Ability to balance needs of researchers and domain experts
  • Strong communication skills and openness to feedback
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
20 Employees
Year Founded: 2024

What We Do

Transluce is an independent research lab that builds open, scalable technology for understanding AI systems and steering them in the public interest. Transluce means to shine light through something to reveal its structure. Today’s complex AI systems are difficult to understand—not even experts can reliably predict their behavior once deployed. Given AI's extraordinary consequences on society, we need scalable and open analyses of the capabilities and risks of AI systems. We are building open source, AI-driven tools to understand and analyze AI systems. We will apply these tools to open-weight models, so the world can vet our analyses and improve their reliability. Once our technology has been vetted, we will work with frontier AI labs and governments to ensure that internal assessments reach the same standards as our publicly vetted procedures. Email: [email protected]

Similar Jobs

PwC Logo PwC

Connected Supply Chain, Planning - Kinaxis, Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
18 Locations
370000 Employees
99K-232K Annually

PwC Logo PwC

Strategy& Financial Services - AWM Consulting Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
14 Locations
370000 Employees
99K-232K Annually

PwC Logo PwC

Connected Supply Chain, Planning - Kinaxis, Senior Associate

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
18 Locations
370000 Employees
77K-202K Annually

Cox Enterprises Logo Cox Enterprises

Communications Specialist

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
61K-92K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account