Customer Reliability Manager

Posted 2 Hours Ago
Be an Early Applicant
3 Locations
In-Office
Senior level
Blockchain • Web3
The Role
Lead and grow a Customer Reliability Engineering team, ensuring high-quality support and reducing friction in deployments and operations across various models.
Summary Generated by Built In
About the company

Braintrust is the AI observability platform. By connecting evals and observability in one workflow, Braintrust gives builders the visibility to understand how AI behaves in production and the tools to improve it.

Teams at Notion, Stripe, Zapier, Vercel, and Ramp use Braintrust to compare models, test prompts, and catch regressions — turning production data into better AI with every release.

About the Role

At Braintrust, exceptional support is one of our most important strategic advantages. Support is part of Engineering at Braintrust and exists to help reduce friction in the deployment and operation of our product. Our customers are developers building LLM-powered applications, and they move fast. We win by helping them move faster.

We’re looking for a manager to build and lead a team of highly senior and knowledgeable Customer Reliability Engineers to provide ambitiously high quality support focused on customer infrastructure. This team is responsible for reducing friction associated with Braintrust's various deployment models (hybrid, BYOC, and SaaS Enterprise). Engineers on this team directly scope and attempt fixes for infrastructure issues, manage high-stakes customer environments, and ensure product reliability across all customer deployment types.

This role blends engineering leadership, deployment expertise, and customer experience. If you love upleveling Senior+ level talent, scaling cutting edge and complex support motions, and reducing pain for developers, we’d love to talk with you.

What You’ll Do
  • Lead and grow a team of Customer Reliability Engineers, delivering reliable, high-touch support across all Braintrust deployment models: hybrid, Bring Your Own Cloud (BYOC), and enterprise SaaS

  • Own the primary after-hours on-call rotation for customer-reported SEV1s, with backup coverage from Customer Solution Architects (CSAs) and Developer Support Engineers.

  • Run incident response and escalation, including enabling customer infrastructure teams while jumping in hands-on for the highest-severity issues.

  • Own day-to-day tickets tied to deployments, upgrades, and performance troubleshooting.

  • Triage and scope deployment-related feature requests and bug reports, attempt fixes when feasible, and route custom work to Professional Services when needed.

  • Lead new BYOC deployments and upgrades.

  • Respond to high-severity alerts for BYOC customers.

  • Validate each new data plane release against the standard hybrid deployment, and partner with Docs to ship upgrade guidance alongside the changelog.

  • Coach and mentor the team on infrastructure debugging, deployment best practices, and strong customer ownership.

  • Synthesize customer feedback and operational trends for Product and Engineering to improve reliability and reduce recurring pain points.

You Might Be a Fit If You
  • Have 5–10+ years of experience leading support for developer-facing products.

  • Deeply familiar with deploying Terraform, Helm, and Kubernetes based infrastructure across major cloud providers.

  • Are comfortable reviewing, debugging, and reasoning about backend services, infrastructure, and deployment configurations.

  • Take ownership of customer-impacting issues end-to-end, ensuring accountability, follow-through, and continuous improvement.

  • Communicate clearly and empathetically, especially when navigating ambiguity or high-stakes customer situations.

  • Are deeply curious about LLM use cases and excited to lead teams building cutting edge support systems for AI products that are measurable, reliable, and trustworthy.

Bonus Points For
  • Familiarity with OpenAI, Anthropic, or similar LLM providers at a systems or integration level.

  • Experience guiding teams working with datasets, evaluation metrics, or prompt engineering.

  • A track record of building or scaling support tooling, documentation programs, or product-led growth initiatives.

  • Experience as a senior technical leader or tech lead in a high-growth startup environment.

  • History of partnering hands on with Engineering on production fixes for backend services, SDKs, or infrastructure.

  • Experience leading support for products with self-hosted offerings (e.g., Terraform, Kubernetes) and comfort leading incident response involving customer owned containerized environments.

Benefits include
  • Medical, dental, and vision insurance

  • Daily lunch, snacks, and beverages

  • Flexible time off

  • Competitive salary and equity

  • AI Stipend

Equal opportunity

Braintrust is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
241 Employees
Year Founded: 2018

What We Do

Braintrust is the first decentralized Web3 talent network that connects skilled, vetted knowledge workers with the world’s leading companies. The community that relies on Braintrust to find work are the same people who own and build it, ensuring the network always serves the needs of its users, instead of a centrally-controlled corporation. And because the community of knowledge workers and contributors earns ownership and control of Braintrust through its native BTRST token for their contributions to the network and its growth, new Talent and jobs have participated in the network at record speeds. Braintrust has over 700,000+ community members, with knowledge workers and project contributors across the world. Braintrust is trusted by hundreds of Fortune 1000 global enterprises including Nestlé, Porsche, Atlassian, Goldman Sachs, and Nike. For more information, visit: www.braintrust.com. BTRST is available on Coinbase.com and in the Coinbase Android and iOS apps. Coinbase customers can trade, send, receive, or store BTRST in most Coinbase-supported regions. For more information on Braintrust and the BTRST token, read the “Braintrust: The Decentralized Talent Network” whitepaper.

Similar Jobs

Rapid7 Logo Rapid7

Account Executive

Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Remote or Hybrid
WA, USA
2400 Employees
120K-162K Annually

Snap Inc. Logo Snap Inc.

Principal Software Engineer

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
6 Locations
5000 Employees
235K-414K Annually

Sonar Logo Sonar

Global Cloud Alliances Leader

Artificial Intelligence • Cloud • Security • Software
Easy Apply
Remote or Hybrid
Seattle, WA, USA
800 Employees

Leader Bank Logo Leader Bank

SBL Underwriting Specialist

Fintech • Insurance • Payments • Social Impact • Financial Services
Remote or Hybrid
United States
420 Employees
90K-110K Annually

Similar Companies Hiring

Bitnomial Thumbnail
Web3 • Software • Fintech • Financial Services • Cryptocurrency • Blockchain
Chicago, IL
26 Employees
Block Thumbnail
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Oakland, CA
12000 Employees
Rain Thumbnail
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3 • Infrastructure as a Service (IaaS)
New York, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account