Sr Backend Engineer - AI

Posted Yesterday
Be an Early Applicant
Hiring Remotely in India
Remote
Senior level
Artificial Intelligence • Information Technology • Software • Automation
The Role
Build the AI layer for SigNoz: implement agent orchestration, long-running execution, streaming APIs (SSE), and MCP server integrations to expose observability data to LLM agents. Ship production LLM features, design scalable distributed backends in Go and async Python, and drive projects end-to-end in a remote-first environment.
Summary Generated by Built In
About SigNoz

SigNoz is an open-source observability platform that helps modern engineering teams monitor, debug, and optimize their applications with deep visibility into metrics, traces, and logs, all in one place. We're built natively on OpenTelemetry and offer both self-hosted and cloud options, so teams can run observability the way they want, without vendor lock-in.

We are growing fast and building core developer infra products. And we are not fooling around:

  • 27,000+ GitHub stars

  • 900+ customers

  • 8,000+ members in our Slack community

Role: Sr Backend Engineer - AI

We're looking for a Sr Backend Engineer for Nerve team, our AI pod. Nerve team builds the AI layer of SigNoz: Noz (observability AI teammate inside SigNoz), the SigNoz MCP server, AI SRE, and LLM/GenAI observability. We're building agent-native observability at SigNoz, where the engineer and their agent are both first-class users.

What you'll work on

You will be working with the high-caliber team in some of the below areas.

  • Noz, AI teammate: agent orchestration, tool execution, and the APIs that stream results back into the product

  • SigNoz MCP server that exposes observability data to LLM agents, so any agent can query a user's logs, metrics, and traces

  • The hard parts of agent backends: long-running execution, security, streaming over SSE, tool calling, and session state across DBs

  • LLM/GenAI observability, so teams (including us) can monitor their own AI apps with OpenTelemetry

What will make you successful
  • 5-8 years building backend systems in production. You'll work across both Go and Python.

  • Strong with Go (concurrency, channels, performance) and with async Python (FastAPI, Pydantic, asyncio), and comfortable designing APIs and data models.

  • You have built and shipped LLM features in production: agents, tool or function calling, evals, MCP, context and prompt engineering. You know how to make a non-deterministic system behave like a dependable one.

  • Experience building distributed systems while writing clean and scalable code

  • Ability to drive initiatives end-to-end: from problem discovery → design → implementation → rollout

  • Comfortable working in a high-ownership, fast-moving, remote-first environment

  • Strong communication skills, can write clear tech docs and explain trade-offs

Nice-to-haves
  • Past experience in AI or platform teams of series B+ startups

  • Experience in observability (monitoring / logging / tracing)

  • Familiarity with OpenTelemetry and/or ClickHouse, Kafka, Kubernetes, etc.

  • Loves open source, ideally with prior contributions to OSS projects (any size)

Why you'll love working at SigNoz
  • You'll help define agent-native observability, a new category, not maintain a mature dashboard product.

  • Work on a globally used open-source project that engineers actually love

  • Collaborate with high-caliber team who just can't stop shipping

  • Remote-first, async-friendly culture

  • Opportunity to help define the future of open-source observability

Skills Required

  • 5-8 years building backend systems in production
  • Proficiency in Go (concurrency, channels, performance)
  • Proficiency in async Python (FastAPI, Pydantic, asyncio) and API/data model design
  • Built and shipped LLM features in production (agents, tool/function calling, evals, MCP, prompt/context engineering)
  • Experience building distributed systems and writing clean, scalable code
  • Ability to drive initiatives end-to-end: problem discovery, design, implementation, rollout
  • Strong communication skills; can write clear technical documentation and explain trade-offs
  • Comfortable working in a high-ownership, fast-moving, remote-first environment
  • Past experience in AI or platform teams of Series B+ startups
  • Experience in observability (monitoring, logging, tracing)
  • Familiarity with OpenTelemetry, ClickHouse, Kafka, Kubernetes
  • Prior open-source contributions
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Bengaluru
40 Employees

What We Do

Open source Observability platform

Similar Jobs

Tufin Logo Tufin

Senior Data Scientist

Security • Cybersecurity
Remote or Hybrid
India
500 Employees

Motive Logo Motive

Product Manager

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
Remote
India
4000 Employees

Capco Logo Capco

PMO - Transaction

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Remote or Hybrid
India
6000 Employees

Zapier Logo Zapier

Manager or Sr. Manager, Sales Assist

Artificial Intelligence • Productivity • Software • Automation
Remote
30 Locations
800 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account