Product Manager

Posted 17 Hours Ago
Be an Early Applicant
7 Locations
Remote or Hybrid
Senior level
Artificial Intelligence • Information Technology • Software
The Role
As a Product Manager at Arena Intelligence, you'll lead the evaluations platform, transforming AI research into trusted product infrastructure, while collaborating across functional areas and directly engaging with users.
Summary Generated by Built In
About Arena Intelligence

Arena Intelligence is the open platform for evaluating how AI models perform in the real world. Created by researchers from UC Berkeley’s SkyLab, our mission is to measure and advance the frontier of AI for real-world use.

Millions of people use Arena Intelligence each month to explore how frontier systems perform — and we use our community’s feedback to build transparent, rigorous, and human-centered model evaluations. Leading enterprises and AI labs rely on our evaluations to understand real-world reliability, alignment, and impact. Our leaderboards are the gold standard for AI performance — trusted by leaders across the AI community and shaping the global conversation on model reliability and progress.

We’re a team of researchers, engineers, academics, and builders from places like UC Berkeley, Google, Stanford, DeepMind, and Discord. We seek truth, move fast, and value craftsmanship, curiosity, and impact over hierarchy. We’re building a company where thoughtful, curious people from all backgrounds can do their best work. Everyone on our team is a deep expert in their field — our office radiates excellence, energy, and focus.

About the Role


Arena is hiring a Product Manager to lead our evaluations platform.
Evaluations sit at the center of Arena. Our leaderboards and evaluation systems are increasingly used by frontier labs, developers, researchers, and enterprises as signals for model quality, capability, and trust.


The core challenge of this role is not traditional roadmap management. It is defining how fast-moving AI research becomes trusted product infrastructure.
You will operate at the intersection of ML research, engineering, design, and product execution — translating emerging evaluation methodologies into systems and experiences that scale to millions of users and influence how the broader ecosystem interprets AI performance.
This is a high-ownership role in an environment where evaluation methodologies, model capabilities, and ecosystem expectations evolve quickly. Success requires strong systems thinking, technical depth, product judgment, and the ability to operate effectively in ambiguity.

You'll

  • Own the roadmap and product strategy for Arena's evaluations and leaderboard platform.

  • Partner closely with ML researchers to translate emerging evaluation methodologies — multimodal evals, agentic workflows, reasoning traces, and new benchmark categories — into production-quality product experiences.

  • Define how evaluation research moves from prototype → implementation → launch → ecosystem adoption.

  • Drive cross-functional execution across research, engineering, design, and marketing to close the gap between research artifacts and trusted user-facing infrastructure.

  • Prioritize what gets evaluated next based on frontier model trends, developer demand, ecosystem gaps, and strategic opportunities.

  • Build systems, workflows, and operational rigor around evaluation quality, release cadence, and leaderboard credibility.

  • Own product metrics across adoption, engagement, citations, frontier-lab participation, and evaluation throughput.

  • Engage directly with frontier labs, researchers, developers, and enterprise users to identify where current evaluation systems break down and where the ecosystem is headed next.

  • Help shape how Arena balances evaluation rigor, usability, neutrality, and speed as the platform scales.

You'll Have

  • 5–8 years of product management experience in highly technical or ambiguous environments.

  • Strong familiarity with modern AI systems, including LLMs, multimodal models, agents, reasoning systems, and evaluation methodologies.

  • A track record of shipping technically complex products from concept to production.

  • Experience translating research-heavy or technically ambiguous work into clear product direction and execution.

  • Strong systems thinking — you can identify bottlenecks, coordination gaps, and scaling constraints across technical and organizational systems.

  • Exceptional cross-functional leadership skills. You can align researchers, engineers, and designers without relying on formal authority.

  • High agency and strong product judgment. You move quickly, make decisions with incomplete information, and create structure where little exists.

  • Strong written communication. You can write specifications for researchers and product narratives for external technical audiences with equal clarity.

Bonus

  • Technical background in computer science, machine learning, or related fields.

  • Prior experience in evaluations, benchmarking systems, AI infrastructure, research tooling, or developer platforms.

  • Experience building products for technical audiences such as researchers, ML engineers, or developers.

  • Founder or early-stage startup experience.

What we offer
  • We offer competitive compensation and equity aligned to the markets where our team members are based. The base salary range will depend on the candidate’s permanent work location.

  • Comprehensive health and wellness benefits, including medical, dental, vision, and additional support programs.

  • The opportunity to work on cutting-edge AI with a small, mission-driven team

  • A culture that values transparency, trust, and community impact

Come help build the space where anyone can explore and help shape the future of AI.

Arena Intelligence provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, genetics, sexual orientation, gender identity, or gender expression. We are committed to a diverse and inclusive workforce and welcome people from all backgrounds, experiences, perspectives, and abilities.

Skills Required

  • 5-8 years of product management experience in highly technical or ambiguous environments
  • Strong familiarity with modern AI systems, including LLMs, multimodal models, agents, reasoning systems, and evaluation methodologies
  • A track record of shipping technically complex products from concept to production
  • Experience translating research-heavy or technically ambiguous work into clear product direction and execution
  • Strong systems thinking to identify bottlenecks, coordination gaps, and scaling constraints
  • Exceptional cross-functional leadership skills
  • High agency and strong product judgment
  • Strong written communication skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
58 Employees
Year Founded: 2025

What We Do

Created by researchers from UC Berkeley, Arena (formerly LMArena) is a community-powered platform for understanding AI performance in the real world. Tens of millions of builders, researchers, and creative professionals come to Arena to use frontier models and give feedback on their responses, shaping a public leaderboard grounded in real-world use.

Similar Jobs

Babylist Logo Babylist

Product Manager

eCommerce • Healthtech • Kids + Family • Retail • Social Media
Easy Apply
Remote or Hybrid
2 Locations
300 Employees
215K-257K Annually

Learneo Logo Learneo

Senior Product Manager

Artificial Intelligence • Edtech • Machine Learning • Software
Easy Apply
Remote
CAN
397 Employees

Dropbox Logo Dropbox

Product Manager

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
Canada
2500 Employees
157K-213K Annually

iA Financial Group Logo iA Financial Group

Product Manager

Fintech • Insurance • Payments • Financial Services
In-Office or Remote
4 Locations
8690 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Software
US
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account