Principal Software Engineer, AI Observability & Evals Platform

Posted 3 Days Ago
Be an Early Applicant
Boston, MA, USA
In-Office
230K-270K Annually
Expert/Leader
Information Technology • Software • Database
The Role
The Principal Software Engineer will lead architectural decisions, mentor engineers, optimize system performance and reliability, and drive technical direction for LangChain's observability and evaluations platform.
Summary Generated by Built In
About Us

At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown to also offer a platform for building, evaluating, deploying, and operating agents at scale.

With $125M raised at Series B from IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we’re at a stage where we’re continuing to develop new products, growth is accelerating, and all team members have meaningful impact on what we build and how we work together. LangChain is a place where your contributions can shape how this technology shows up in the real world.

Today, LangChain, LangGraph, LangSmith, and Fleet are used by teams shipping real AI products across startups and large enterprises. Millions of developers trust LangChain to power AI teams at companies like Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.

About the Team

The LangSmith team owns and builds LangChain's core platform for observability, evaluation, and production reliability of AI systems. From tracing and annotation to run rules, evaluations, and beyond, they own this end-to-end. If you want to help define what great AI observability looks like at production scale, this is where that work gets done.

About the Role

We're looking for a Principal/Lead level Software Engineer to join the LangSmith team and help drive the technical direction of the platform. You'll build across the full stack from backend services and APIs to frontend product surfaces, and you'll play a central role in shaping how we build: setting engineering standards, mentoring engineers across the team, and making architectural decisions that hold up as we scale. If you're energized by both hands-on engineering and the multiplier effect of leveling up those around you, this role is built for that.

Location: This role can be based in our Boston, San Francisco, or NYC office.

What You'll Do

Drive Technical Direction
  • Lead architectural decisions across our Go, Python, and TypeScript stack, ensuring systems are performant, maintainable, and built to scale

  • Work across the full stack, owning features end-to-end from backend services and APIs through to frontend product experiences

  • Drive tracing, monitoring, and evaluation workflows at scale, with a focus on reliability and query performance across high-volume data

  • Help shape the product roadmap by partnering closely with product and design — not just executing on it

Raise the Bar for the Team
  • Set engineering standards for the team: define patterns, lead code reviews, and establish the foundations others build on

  • Mentor and grow engineers at all levels through code review, design feedback, pairing, and ongoing technical guidance

  • Drive projects from ambiguity to delivery while maintaining high engineering standards and aggressive timelines

Own Reliability and Quality
  • Troubleshoot and resolve production issues with a root-cause mindset, and implement durable fixes

  • Ensure system reliability through strong testing, monitoring, and alerting practices

  • Create and maintain technical documentation, including system design docs and API references

What You'll Bring
  • 10+ years of professional experience in backend or fullstack engineering on highly complex, production systems

  • Strong programming skills across multiple parts of the stack: backend (Python and/or Go) and frontend (TypeScript, React, or similar)

  • Demonstrated experience making and owning architectural decisions, including tradeoffs around data systems, APIs, and service reliability

  • Experience with high-throughput or mission-critical systems, and a proven ability to optimize for performance and reliability

  • Depth in operationalizing technical work — you've taken systems from prototype to production and kept them running well at scale

  • Demonstrated track record of mentoring engineers and raising the technical quality of a team, not just the codebase

  • Strong communication skills and comfort operating cross-functionally with product, design, and engineering leadership

  • Customer centricity and an ownership mentality — you care how the product lands, not just how the code reads

  • You exemplify our operating principles

Nice to Have
  • Experience with database systems (Postgres, Redis, ClickHouse) and cloud platforms (AWS, GCP, or Azure)

  • Familiarity with observability tooling, evaluation frameworks, or AI/LLM infrastructure

Salary Range: $230,000 - $270,000

Compensation Philosophy:

We offer competitive compensation that includes base salary, variable compensation for relevant roles, meaningful equity, benefits, and perks. Actual compensation and offerings will vary based on role, level, and location. Team members in the EU, UK, and APAC receive locally competitive benefits aligned with regional norms and regulations.

Benefits

Benefits include medical, dental, and vision coverage, flexible vacation, a 401(k) plan, meals on in-office days in the US and more.

Skills Required

  • 10+ years of professional experience in backend or fullstack engineering on production systems
  • Strong programming skills in Python and/or Go and TypeScript
  • Experience making and owning architectural decisions
  • Experience with high-throughput or mission-critical systems
  • Track record of mentoring engineers
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
123 Employees

What We Do

LangChain is the platform for building reliable agents. Our products power top engineering teams — from fast-growing startups like Lovable, Mercor, and Clay to global brands including AT&T, Home Depot, and Klarna. LangGraph is a low-level orchestration framework for building controllable agents and long-running workflows. It’s used in production by teams at Replit, Uber, LinkedIn, GitLab, and more. LangSmith offers unified evaluation and monitoring to help developers debug, evaluate, and improve their agents at scale. LangChain provides hundreds of integrations and composable components, making it easy to connect with the latest models, tools, and databases — with minimal engineering overhead. Together, these tools help teams build, deploy, and manage enterprise-grade agents, faster.

Similar Jobs

Cox Enterprises Logo Cox Enterprises

Search Engine Optimization Specialist

Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Remote or Hybrid
United States
50000 Employees
22-33 Hourly
Hybrid
Boston, MA, USA
130 Employees
150K-170K Annually
Hybrid
Boston, MA, USA
130 Employees
170K-200K Annually

Pfizer Logo Pfizer

Artificial Intelligence Engineer

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Hybrid
4 Locations
121990 Employees
139K-232K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account