Essential Duties
- Develop comprehensive solutions aligned with business objectives, considering factors like scalability, performance, security, and cost-efficiency.
- Design, administer and manage multi-tenant observability platforms to monitor and provide visibility into customer environments across cloud, hybrid, and on-premises
- Standardize and maintain observability configurations such as dashboards, alerts thresholds
- Collaborate with members of the Service Desk, NOC, and Advanced Support Team to define and support SLAs, SLOs, KPIs
- Collaborate during customer onboarding activities to define alerting, dashboards, and monitoring baselines
- Continuously improve noise reduction, event correlation, and escalation processes to drive operational efficiency
- Participate in incident investigations by leveraging observability data to perform root cause analysis and accelerate resolution
- Design and configure automated incident resolution activities using native or third-party integrations within the observability platform
- Ensure observability solutions align with compliance, security, across customer environments
- Configure integration and optimization to ITSM platform, ServiceNow
- Assist with information gathering and reporting to either clients or Client Success Managers
- Collaborate with senior engineers to escalate issues as needed
- Develop, maintain and update technical knowledge base articles
- Client interactions will be required
- Collaborate with peers across Managed Services Practice
- After hours / weekend work may be required
- Other duties as assigned and directed
Knowledge, Skills, and Abilities
- 3-5 years of experience in IT or MSP environment with a focus on monitoring tools
- Strong background in monitoring platforms, Logic Monitor or similar (SolarWinds, Science Logic, etc.)
- Experience using Event Management, Event Correlation and AIOps tools BigPanda or similar (Splunk, Edwin AI, etc.)
- Proficiency with automation and scripting, Python, Ansible, PowerShell, Go
- Knowledge of ITIL processes (incident, problem, change management) and integration with ITSM platform, ServiceNow
- Familiarity with customer reporting, SLA management, and service-level dashboards
- Strong problem-solving and troubleshooting abilities
- Excellent written and verbal communication skills
- Ability to work independently and manage multiple priorities
- Strong customer service focus and ability to work collaboratively with non-technical users
- Experience with Azure concepts, Log Analytics, Azure Monitor
- Hands-on experience with Infrastructure as Code tools like Bicep, Terraform and Ansible
- Experience navigating ticketing systems such as ServiceNow
- Experience using monitoring tools such as Logic Monitor
- Bachelor's degree or higher education diploma in Information Technology is desired
- Certifications like Azure Administrator Associate, AZ-104, ITL v4 Foundation, or any monitoring tool are highly desired
Top Skills
What We Do
In North America, NRI is a business and technology solutions consultancy. Guiding our clients from insight to execution, we design and deliver solutions that fuel growth, grow profitability, and deliver innovation with impact.
When you partner with NRI, you gain more than hands-on collaboration from our North American team—you connect to a global network of cross-functional experts working to keep you ahead of change.
We strategize and advise, build and transform, and protect and manage the critical systems and strategies that power modern business. From optimizing operations to driving seamless digital adoption, our solutions don’t just solve problems—they create lasting momentum.