Site Reliability Engineer (Observability)

Posted 9 Days Ago
Be an Early Applicant
Hyderabad, Telangana
3-5 Years Experience
Information Technology • Business Intelligence • Consulting
The Role
The Site Reliability Engineer ensures system reliability, availability, and performance, collaborating with cross-functional teams. Responsibilities include designing scalable systems, maintaining observability solutions, analyzing performance, and mentoring junior resources.
Summary Generated by Built In

Make an impact with NTT DATA
Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive.

Your day at NTT DATA

The Site Reliability Engineer (SRE) is a seasoned subject matter expert, responsible for ensuring the reliability, availability, and performance of company systems and infrastructure.
This Site Reliability Engineer (SRE) works closely with development teams, operations teams, and other stakeholders to enhance system resiliency, automate processes, and improve overall system reliability.

What you'll be doing

Job Description

The Site Reliability Engineer (SRE) is an advanced subject matter expert, responsible for leading the efforts in ensuring the reliability, scalability, and performance of the company systems and infrastructure. SRE engineer in this role is expected to be an SME for Observability with Logic Monitor FSO.
This advanced subject matter expert collaborates with cross functional teams, provides technical guidance, and drives strategic initiatives to enhance system reliability, automate processes, and improve overall operational efficiency.
The Site Reliability Engineer coaches and provides mentoring to junior resources within the team.

Key Responsibilities

· Designs and architects resilient and scalable systems, ensuring high availability, fault tolerance, and efficient resource utilization.

· Establishes and maintains robust Observability solutions to proactively detect system issues, performance bottlenecks, and security vulnerabilities.

· Continuously analyses system performance, identifies bottlenecks, and implements optimizations to improve system scalability, responsiveness, and resource efficiency.

· Leads capacity planning efforts, analyses system resource utilization, and forecasts future needs to ensure adequate scalability and optimal resource allocation.

· Deliver implementations or custom-scoped technical solutions to Logic Monitor customers in-line with customer requirements and signed SOWs

· As Solutions Architect is responsible architecting, and the successful delivery of Logic Monitor based Full Stack Observability solution.

· Duties vary from crafting advanced configurations of Logic Monitor, leading discovery, design & deployment working sessions with customers and relaying product features and improvements to customers CIO/CTO teams.

· Considered a Subject Matter Expert on all things Logic Monitor based FSO solutions.

· Act as the subject matter expert for CMDB integrations

· Assist the Solution Architect where required in scoping of CMDB integration projects

· Guide customers on best practices and how to leverage CMDB integrations in efficient scalable solutions.

· Attend remote working sessions with customers to drive successful FSO adoption - through discovery, design, and deployment of the NTT Managed Services Platform.

· Identify gaps, feature requests or issues with going solutions and escalate to LM product and development teams.

· Assists develop customer-specific, scripted solutions using Logic Monitor product features (Websites, Logic Modules, NetScans) and externally using the REST API

· Occasionally assist Monitoring Engineering with Logic Module development

· Provides technical leadership and mentorship to junior team members.

· Fosters a collaborative and inclusive work environment and drives cross functional initiatives and facilitates knowledge sharing and continuous learning across the organization.

· Stays updated with industry trends, emerging technologies, and best practices to drive innovation and improve overall system performance.

Knowledge and Attributes

· Advanced technical expertise in Linux/Unix systems, networking, and system administration.

· Advanced proficiency in scripting or programming languages, such as Python, Go, Java, or Ruby.

· Advanced knowledge of cloud platforms (such as AWS, Azure, or Google Cloud) and associated services.

· Advanced proven expertise in performance monitoring, optimization, and troubleshooting using tools such as Prometheus, Grafana, or New Relic.

· Advanced expertise in incident management, root cause analysis, and post-incident reviews

· Excellent problem-solving and analytical skills, with a keen attention to detail.

· Excellent communication, collaboration, and leadership skills.

· Advanced ability to optimize system performance, scalability, and reliability. Experience with performance monitoring and tuning tools (for example, Prometheus, Grafana, or New Relic) to identify bottlenecks, analyse performance data, and implement optimization strategies.

· Advanced understanding of security principles, best practices, and compliance requirements. Experience in designing and implementing security controls, performing security assessments, and ensuring compliance with industry standards.

· Willingness to travel (20-25%)

Workplace type:

Hybrid Working

About NTT DATA
NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo.

Equal Opportunity Employer
NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today.

Top Skills

Logic Monitor
The Company
Brisbane
55,092 Employees
On-site Workplace

What We Do

NTT DATA, Inc. is a trusted global innovator of business and technology services. We're committed to helping clients innovate, optimize and transform for long-term success. Our R&D investments help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity

Jobs at Similar Companies

Silverfort Logo Silverfort

Sales Engineer- TOLA

Information Technology • Sales • Security • Cybersecurity • Automation
Remote
United States
357 Employees

Jobba Trade Technologies, Inc. Logo Jobba Trade Technologies, Inc.

Customer Success Specialist

Cloud • Information Technology • Productivity • Professional Services • Software
Hybrid
Chicago, IL, USA
45 Employees

InCommodities Logo InCommodities

Head of People & Culture - US

Information Technology • Machine Learning • Analytics • Energy • Automation • Renewable Energy
Hybrid
Austin, TX, USA
234 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account