Lead Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
2 Locations
In-Office or Remote
Senior level
Big Data • Cloud • Information Technology
The Role
Responsible for implementing and enhancing enterprise observability and automation platforms, ensuring optimal network performance and compliance with governance standards.
Summary Generated by Built In

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways. 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Job Summary

Iron Mountain is seeking a proactive and skilled Observability Automation & Integration Lead Engineer to join our Infrastructure Transformation team.

In this role, you will be responsible for implementing, managing, and enhancing enterprise observability and automation platforms to ensure optimal network and application performance across a global ecosystem.

The Infrastructure Transformation team is a dynamic group dedicated to modernizing our technical infrastructure, driving efficiency through automation, and ensuring the continuous availability of critical systems.

What You'll Do (Responsibilities)

In this role, you will:

  • Responsibility 1: Drive Enterprise Platform Engineering - Design, implement, and maintain highly available, 24x7 continuous monitoring solutions using platforms like Datadog and SolarWinds, including configuring alerts, creating dashboards, and conducting data trend analysis.

  • Responsibility 2: Champion Automation & Integration - Collaborate with Enterprise Architects and operations teams to automate infrastructure operations, integrate monitoring data with platforms like Configuration Management Database (CMDB)/ServiceNow, and identify opportunities for proactive monitoring solutions.

  • Responsibility 3: Ensure Design and Operational Adherence - Ensure compliance with architectural governance and security standards in all designs, drive process improvements, and provide on-call support for critical issues outside of normal business hours.

What You'll Bring (Skills & Qualifications)

The ideal candidate will have:

  • 10+ years of experience in monitoring platform engineering with tools such as Datadog, SolarWinds, Prometheus, or Grafana.

  • Strong knowledge of network and application performance monitoring, including configuring monitors using protocols like Simple Network Management Protocol (SNMP), Secure Shell (SSH), Windows Remote Management (WinRM), Windows Management Instrumentation (WMI), or Java Management Extensions (JMX).

  • Proven ability in automating infrastructure operations using tools like Ansible and Python and integrating systems via Representational State Transfer (REST) Application Programming Interface (API)/scripting.

  • Bachelor's degree in Computer Science, Information Technology, or a related field.

What We Offer (Benefits)

This section lists benefits specific to the role and region. Since this information was not included in the original job description, I will include the standard Iron Mountain offerings from the template as a placeholder, which can be modified based on the role and location requirements:

  • Competitive compensation and benefits aligned with the experience.

  • Flexible work options/alternative work options to support work-life balance.

  • Comprehensive health, wellness, and retirement plans.

  • Opportunities for continuous learning and professional growth.

Call to Action

If you are passionate about building scalable, high-performance systems and enhancing enterprise observability, apply today to join the Iron Mountain Infrastructure Transformation team!

Category: Information Technology

Top Skills

Ansible
Datadog
Grafana
Jmx
Prometheus
Python
Rest Api
Snmp
Solarwinds
Ssh
Winrm
Wmi
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Boston, MA
32,000 Employees
Year Founded: 1951

What We Do

Iron Mountain Incorporated (NYSE: IRM) is the global leader for storage and information management services. Trusted by more than 220,000 organizations around the world, Iron Mountain boasts a real estate network of more than 80 million square feet across more than 1,350 facilities in 45 countries dedicated to protecting and preserving what matters most for its customers. Iron Mountain’s solutions portfolio includes records management, data management, document management, data centers, art storage and logistics, and secure shredding help organizations to lower storage costs, comply with regulations, recover from disaster, and better use their information. Founded in 1951, Iron Mountain stores and protects billions of information assets, including critical business documents, electronic information, medical data and cultural and historical artifacts.

Gallery

Gallery

Similar Jobs

MetLife Logo MetLife

Assistant Manager - Technology Services

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
India
43000 Employees

MetLife Logo MetLife

Platform Engineer

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
India
43000 Employees

GitLab Logo GitLab

Account Executive

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
India
2500 Employees

SailPoint Logo SailPoint

Senior Software Engineer

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
India
2461 Employees

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
LayerOne Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account