Technical Product Manager - AI Cloud Observability

Posted 9 Days Ago
Be an Early Applicant
4 Locations
In-Office
Mid level
Artificial Intelligence • Information Technology • Consulting
The Role
Lead the vision, roadmap, and priorities for observability services in Nebius Cloud, managing backlogs and coordinating company-wide initiatives.
Summary Generated by Built In

Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.

Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.

The role

Nebius is looking for a Technical Product Manager – Observability to join the team. In this role, you will own the vision, roadmap, and priorities for observability services in Nebius Cloud, including monitoring, logging, tracing, alerting, and value-added services built on top of these capabilities.

Also, you will be responsible for shaping and managing backlogs for observability service teams and leading key, company-wide initiatives related to observability. This role requires strong technical depth combined with the ability to coordinate across engineering, development, product, technical support, and go-to-market teams.

Your responsibilities will include: 

  • Own and manage the product backlog for observability service teams
  • Lead and coordinate key cross-company initiatives and implementations involving observability
  • Work closely with engineering and architecture teams to define product requirements and deliver new observability features
  • Partner with product marketing and technical pre-sales/post-sales teams on technical publications, go-to-market activities, customer engagement, acquisition, and retention related to observability
  • Ensure the delivery of observability services that meet high standards for performance, scalability, reliability, and usability, including ML-focused observability scenarios.

We expect you to have: 

  • Experience designing, implementing, or operating large-scale observability platforms in senior engineering, architecture, or technical leadership roles (e.g., large enterprises, hyperscalers, cloud providers, or other advanced technology companies)
  • Hands-on familiarity with observability technologies and products such as OTLP, Prometheus, Grafana, VictoriaMetrics, and ClickHouse
  • Strong technical expertise in at least two of the following areas:
    • Logging, monitoring, and tracing ingestion, processing, and storage backends
    • Metrics trend analysis, anomaly detection, and insights extraction
    • Observability UX (GUIs, dashboards, drill-downs, alerts, insights)
    • Observability for ML workloads, including ML-specific instrumentation, tools and MLOps metrics
  • Proven track record of delivering complex technical initiatives requiring coordination across multiple teams or stakeholders
  • Technical leadership experience is a strong plus
  • Product management experience is not required, but a strong willingness to learn and grow into the role is essential.

It will be an added bonus if you have: 

  • Experience creating technical documentation, guides, tutorials, or learning materials related to observability platforms or tools
  • Willingness and ability to contribute to developer-facing documentation, best practices, and educational content for observability services.

About Nebius

Nebius AI is an AI cloud platform with one of the largest GPU capacities in Europe. Launched in November 2023, the Nebius AI platform provides high-end, training-optimized infrastructure for AI practitioners. As an NVIDIA preferred cloud service provider, Nebius AI offers a variety of NVIDIA GPUs for training and inference, as well as a set of tools for efficient multi-node training. 

Nebius AI owns a data center in Finland, built from the ground up by the company’s R&D team and showcasing our commitment to sustainability. The data center is home to ISEG, the most powerful commercially available supercomputer in Europe and the 16th most powerful globally (Top 500 list, November 2023).  

Nebius’s headquarters are in Amsterdam, Netherlands, with teams working out of R&D hubs across Europe and the Middle East. 

Nebius AI is built with the talent of more than 500 highly skilled engineers with a proven track record in developing sophisticated cloud and ML solutions and designing cutting-edge hardware. This allows all the layers of the Nebius AI cloud – from hardware to UI – to be built in-house, distictly differentiating Nebius AI from the majority of specialized clouds: Nebius customers get a true hyperscaler-cloud experience tailored for AI practitioners. We’re growing and expanding our products every day. 

What we offer 

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!

Top Skills

Clickhouse
Grafana
Otlp
Prometheus
Victoriametrics
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
473 Employees

What We Do

Cloud platform specifically designed to train AI models

Similar Jobs

Celonis Logo Celonis

Senior Threat Detection Engineer

Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
Hybrid
Prague, CZE
3000 Employees

2K Logo 2K

Lead Technical Animator

Gaming • Information Technology • Mobile • Software • Esports
Hybrid
Prague, CZE
3505 Employees

Schrödinger, Inc. Logo Schrödinger, Inc.

Account Manager

Healthtech • Machine Learning • Software • Biotech • Pharmaceutical
Hybrid
28 Locations
885 Employees

Rapid7 Logo Rapid7

Legal Counsel

Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Remote or Hybrid
Prague, CZE
2400 Employees

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account