Senior Systems Engineer (Site Reliability Engineering)

Posted 4 Days Ago
Be an Early Applicant
Īnd, Chamba, Himāchal Pradesh
Expert/Leader
Big Data • Cloud • Information Technology
The Role
Provides technical support for users of computer applications and hardware, collaborates with network services and application development, maintains troubleshooting tracking log, manages escalations and triage with technical teams, and supports applications built on various platforms.
Summary Generated by Built In

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways. 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Job Summary

Provides technical support for users of computer applications and hardware (e.g., PCs, servers, mainframes). Answers questions regarding system procedures, online transactions, systems status and downtime procedures. Collaborates with network services, software systems engineering and/or application development in order to restore service and/or identify problems. Maintains a troubleshooting tracking log ensuring timely resolution of problems.

Required skills and Experience:

  • Records Center Applications and Data Management Applications that deliver records and data management capabilities such as storing, archiving, shredding, asset transfer, permanent withdrawal, etc.

  • Managing the Observability strategy sitting with Engineering/Development and SRE teams to improve availability, performance and reliability of the applications.

  • Application Sustainment Management & Global Service Delivery

  • Managing escalations from customers, Customer Care, Global Account Management and handling triage with technical teams

  • Qualifying new work orders from customers that request Account Consolidation and Single Sign-On capabilities to improve the revenue from the sustainment services.

  • SRE Management (Site Reliability Engineering)

  • As Application SRE Engr, focus on the exception handler streamlining to build and support the log based metrics definition.

  • Understanding of the Datadog Log mgmt and Datadog alert mgmt features to define Log based metrics, alerts and dashboards.

  • Support of Applications built with Google Cloud logging, Identity & Access mgmt, Cloud network and Project.

  • Understanding of the Gitlab code repository mgmt, Roles, Projects, Groups, merge request, container registry management, reporting DevOps metrics and analytics

  • Managing the Data Center Applications built on Linux/Windows, Apache/Tomcat & Java

  • Bachelors of Science in Computer Science & Engineering (4 years degree) and 5+ years working experience.

  • Scrum Master/PMP Certification / Agile SAFe certification

Qualifications:

  • Minimum 5 years SRE Engineer experience

  • Build and Manage procedures for Cloud/Web/4GL/DataCenter Application Systems optimization, as performance improvement, and workflow design (or redesign).

  • Improving the reliability, performance and availability of the applications. Responsible for leading the technical strategy for our underpinning infrastructure, alerting & monitoring and incident resolution to optimize the MTTR targets.

  • Accountable for the performance and results of multiple applications

  • Works on issues where analysis of situations or data requires an in-depth knowledge of organizational objectives and processes.

  • Experience in managing & supporting monthly/quarterly/annual Billing Cycles that are critical for the companys financial health.

  • SRE Engineer supporting Cloud native Applications in Google Cloud Platform (GCP) with prior experience in:

  • Building Automation Services and Instrumentation for Iron Mountain's Observability Program

  • Implementing Application Reliability strategy for Iron Mountain Warehouse Applications

  • Defining the process of governance with the selected feature set from the platform used for Observability

  • Experience as SRE Engineer in defining the log based metrics, monitors, thresholds for defining Error Budget, Service Level Objectives (SLO), SLI and creating event dashboards for cloud native Iron Mountain.

  • Build procedures for Infrastructure optimization, as performance improvement, and workflow design (or redesign).

  • Improving the reliability, performance and availability of the applications. Responsible for leading the technical strategy for our underpinning infrastructure, alerting & monitoring and incident resolution to optimize the MTTR targets.


Preferred skills:

  • Past experience in Software AppDev in Golang and/or Java/.Net

  • Secret management platform Thycotic/Delinea Secret Server, AKeyelss and/or Thycotic DevOps Secret vault

  • Gitlab Agile - epics, features, stories, product management support, program management, boards, reporting metrics, KPI.

  • Experience as SRE Engineer in defining the log based metrics, monitors, thresholds for defining Error Budget, Service Level Objectives (SLO), SLI and creating event dashboards for Iron Mountain.

  • Experience in Google Cloud Functions, Workflow, Google Kubernetes Engine (GKE)

Category: Information Technology

Top Skills

Java
The Company
HQ: Boston, MA
32,000 Employees
Hybrid Workplace
Year Founded: 1951

What We Do

Iron Mountain Incorporated (NYSE: IRM) is the global leader for storage and information management services. Trusted by more than 220,000 organizations around the world, Iron Mountain boasts a real estate network of more than 80 million square feet across more than 1,350 facilities in 45 countries dedicated to protecting and preserving what matters most for its customers. Iron Mountain’s solutions portfolio includes records management, data management, document management, data centers, art storage and logistics, and secure shredding help organizations to lower storage costs, comply with regulations, recover from disaster, and better use their information. Founded in 1951, Iron Mountain stores and protects billions of information assets, including critical business documents, electronic information, medical data and cultural and historical artifacts.

Gallery

Gallery

Jobs at Similar Companies

Silverfort Logo Silverfort

Commercial Sales Manager- East

Information Technology • Sales • Security • Cybersecurity • Automation
Remote
8 Locations
357 Employees

Jobba Trade Technologies, Inc. Logo Jobba Trade Technologies, Inc.

Senior Back End Developer

Cloud • Information Technology • Productivity • Professional Services • Software
Remote
Hybrid
Chicago, IL, USA
45 Employees

InCommodities Logo InCommodities

Head of People & Culture - NA

Information Technology • Machine Learning • Analytics • Energy • Automation • Renewable Energy
Hybrid
Austin, TX, USA
234 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account