Product Manager (Lighthouse)

Sorry, this job was removed at 07:02 p.m. (CST) on Tuesday, Jan 27, 2026
4 Locations
In-Office
Artificial Intelligence • Software
The Role
About FluidStack

At Fluidstack, we’re building the infrastructure for abundant intelligence. We partner with top AI labs, governments, and enterprises - including Mistral, Poolside, Black Forest Labs, Meta, and more - to unlock compute at the speed of light.

We’re working with urgency to make AGI a reality. As such, our team is highly motivated and committed to delivering world-class infrastructure. We treat our customers’ outcomes as our own, taking pride in the systems we build and the trust we earn. If you’re motivated by purpose, obsessed with excellence, and ready to work very hard to accelerate the future of intelligence, join us in building what's next.

About the Role

We're looking for a Product Manager to lead Lighthouse, our MLOps and observability platform. You'll own the complete product lifecycle—from strategy and roadmap to execution and customer success.

You will work directly with our engineering and infrastructure teams as well as collaborate closely with customers to ensure that we're providing ML developers the metrics that matter. You will have the opportunity to partner with top tier AI labs to increase their utilization and performance as well as scale our infrastructure to hundreds of thousands of GPUs.

Focus
  • Building and executing on the roadmap for Lighthouse.

  • Partner with engineering to translate customer requirements into technical specifications and guide implementation.

  • Creating alerting rules for GPU cluster health, job failures, and resource bottlenecks

  • Designing dashboards for ML-specific KPIs (training loss curves, inference latency, batch processing metrics)

  • Collaborate with sales and customer success teams to drive adoption, gather feedback, and ensure customer satisfaction.

  • Engage directly with AI labs and enterprises to understand their observability challenges and shape the product roadmap accordingly.

About You
  • 3-5+ years of experience building developer tools or cloud infrastructure, ideally in the observability space.

  • Deeply experienced with the LGTM stack, Alertmanager, or proprietary observability tools like Datadog, etc.

  • Have an understanding of the metrics that matter to an AI/ML customer, including infrastructure availability, performance, and utilization, as well as application level metrics like MFU.

  • Understanding of GPU monitoring tools (DCGM, nvidia-smi, GPU exporters for Prometheus).

  • Knowledge of Infrastructure-as-Code (IaC) tools (e.g. Terraform, Pulumi) to standardize and simplify the deployment of the observability stack.

  • Comfortable writing SQL queries.

  • Understanding of SLA, SLO, frameworks and error budget management.

  • Experience with ML-specific monitoring tools (Weights & Biases, ClearML, etc.).

Salary & Benefits
  • Competitive total compensation package (salary + equity).

  • Retirement or pension plan, in line with local norms.

  • Health, dental, and vision insurance.

  • Generous PTO policy, in line with local norms.

The base salary range for this position is $200,000 - $300,000 per year, depending on experience, skills, qualifications, and location. This range represents our good faith estimate of the compensation for this role at the time of posting. Total compensation may also include equity in the form of stock options.

We are committed to pay equity and transparency.

Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Fluidstack will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Similar Jobs

Motorola Solutions Logo Motorola Solutions

Senior Software Engineer

Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Remote or Hybrid
Texas, USA
23000 Employees
145K-160K Annually

Eve Logo Eve

Senior Customer & Social Media Marketer

Legal Tech • Software • Generative AI
Easy Apply
Remote or Hybrid
United States
87 Employees

Arm Logo Arm

Project Manager

Artificial Intelligence • Internet of Things • Semiconductor
In-Office
Austin, TX, USA
8314 Employees
140K-189K Annually

Apryse Logo Apryse

Sales Manager

Productivity • Software • App development • Automation
In-Office or Remote
2 Locations
665 Employees
200K-240K Annually
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
30 Employees
Year Founded: 2017

What We Do

Instantly reserve dedicated clusters of NVIDIA H200s and GB200s for any scale to supercharge your training and inference workflows.

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account