Senior Software Engineer, Observability

Posted 3 Days Ago
Hiring Remotely in United States
Remote
130K-170K Annually
Senior level
Artificial Intelligence • Information Technology • Consulting
The Role
Design, build, and maintain backend systems for metrics and monitoring, collaborate across teams, and debug production incidents.
Summary Generated by Built In

Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.

Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.

The Role

Nebius is hiring a Senior Software Engineer to design, build, and own backend systems that power metrics, monitor large-scale infrastructure, and develop a comprehensive infrastructure maintenance platform. This role requires strong production experience, sound system design judgment, and the ability to operate and improve critical services.

Your responsibilities will include:

  • Design and build services and agents that provide deep visibility into large-scale server fleets and data center engineering systems
  • Evolve metrics, aggregation, and alerting pipelines, with a focus on signal quality and reliability
  • Design and operate maintenance and remediation systems that enable safe, predictable fleet-wide changes and keep infrastructure healthy
  • Investigate production incidents hands-on, including on-host Linux debugging, and drive root-cause fixes
  • Collaborate closely with hardware, networking, and data center operations teams to improve reliability

What we expect you to have:

  • 5+ years of professional software engineering experience
  • Strong production experience with Python and Go, or the ability to ramp up quickly
  • Solid Linux fundamentals and comfort debugging live systems
  • Ability to write reliable, maintainable code and dig into complex, ambiguous problems
  • Experience building and operating production systems at scale

It will be an added bonus if you have:

  • Ubuntu experience, including internal tooling and packaging workflows (e.g., building Debian packages)
  • CCNA (Cisco Certified Network Associate) or equivalent networking experience

Key employee benefits: 

  • Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families. 
  • 401(k) plan: up to 4% company match with immediate vesting. 
  • Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers. 
  • Remote work reimbursement: up to $85/month for mobile and internet. 
  • Disability & life insurance: company-paid short-term, long-term and life insurance coverage. 

Compensation

  • We offer competitive salaries, ranging from $130k- $170k base + quarterly performance bonuses.

Join Nebius Today!

What we offer 

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!

Top Skills

Ccna
Debian Packaging
Go
Linux
Python
Ubuntu
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
473 Employees

What We Do

Cloud platform specifically designed to train AI models

Similar Jobs

PlayOn Logo PlayOn

Senior Software Engineer

Digital Media • Software • Sports
Remote
USA
400 Employees

TetraScience Logo TetraScience

Senior Software Engineer

Cloud • Software • Database
Remote
United States
196 Employees

Unanet Logo Unanet

Senior Software Engineer

Enterprise Web • Fintech • Marketing Tech • Software
Remote
United States
435 Employees
140K-155K Annually

Caterpillar Logo Caterpillar

Project Lead, Remote Service

Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
Remote or Hybrid
Peoria, IL, USA
100000 Employees
128K-193K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account