Product Engineer

Reposted 10 Days Ago
Be an Early Applicant
San Francisco, CA, USA
In-Office
Junior
Artificial Intelligence • Software
The Role
Build full-stack product features for an agent behavior monitoring platform: collaborate with design, engage customers, implement front-end/back-end and database work, run experiments, and contribute to product design and demos.
Summary Generated by Built In

Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and latency, our ABM surfaces behavioral anomalies such as instruction drifts and context retrieval loss in scaled production environments.

Hundreds of teams building autonomous agents rely on Judgment to understand how their systems are behaving post-deployment. Instead of reactive incident triage, they cluster patterns across conversations and workflows, correlate regressions to specific interaction types, and pinpoint where reliability breaks down in their usage context.

We’ve raised $30M+ across two rounds in the past five months. Our investors include Lightspeed, SV Angel, Valor Equity Partners, Nova Global, Chris Manning, Michael Ovitz, Michael Abbott, Cory Levy, Kevin Hartz, and others.

The Role:

We are looking for a Product Engineer with 2+ years of experience to join our team and build high-taste products for self-learning agents. This role is crucial for scaling our product to meet customer demand and maintaining our reputation for exceptional user experience in the rapidly evolving AI agent space. We need someone who is passionate about improving agent behavior and can contribute to a product that is both powerful and a joy to use.

What You’ll Do:
  • Collaborating with our designer to build out new features and refine existing user flows based on customer feedback.

  • Engaging with customers to understand their needs, gather feedback, and pitch new features.

  • Developing core product features on both the front-end and back-end, including database work.

  • Experimenting with new features and technologies, conducting quick demos for the team, and building out promising solutions.

  • Contributing to product design reviews, ensuring a high-quality and intuitive user experience.

Why Judgment?
  • Agents can’t work without this. Today’s agents hallucinate, drift, and break in production. We’re building the infrastructure that fixes this: the monitoring layer that makes agents self-improving.

  • We’re wired to win. We're a team of less than 20 but we ship like 50+ on the daily. You'll be working with olympiad medalists, debate champions, and competitive athletes who bring that same intensity to company building.

  • Fast track to founding. Our engineers interface directly with customers, ship code into their environments, and use their feedback to dictate what’s next on the roadmap. Everyone on the team is either an ex-founder or a founder-to-be.

  • We make sure our people do their best work. If you deserve a spot on the team, money will never get in the way of it. Full benefits, Equinox, and a private chef to take care of you. We sprint hard but we play hard, ask us about our Smash/Mario Kart tournaments.
    We work in person in San Francisco.

Skills Required

  • 2+ years of experience
  • Develop core product features on front-end, back-end, and database systems
  • Collaborate closely with designers and participate in product design reviews
  • Engage with customers to gather feedback and pitch features
  • Experiment with new features and technologies and present quick demos
  • Passion for improving agent behavior and delivering excellent user experience
  • Work in person in San Francisco
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
20 Employees
Year Founded: 2025

What We Do

Judgment Labs builds agent behavior monitoring (ABM) infrastructure. Judgment provides a toolkit to track and judge agent behavior in online and offline setups, enabling you to convert high-signal interaction data from production/test environments into more reliable agents.

Similar Jobs

Drata Logo Drata

Product Engineer

Security • Software • Cybersecurity • Automation
Hybrid
San Francisco, CA, USA
600 Employees
167K-226K Annually

Drata Logo Drata

Product Engineer

Security • Software • Cybersecurity • Automation
Hybrid
San Francisco, CA, USA
600 Employees
192K-260K Annually

Samsara Logo Samsara

Support Engineer

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
United States
4000 Employees
90K-121K Annually

Drata Logo Drata

Product Engineer

Security • Software • Cybersecurity • Automation
Hybrid
San Francisco, CA, USA
600 Employees
192K-260K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account