Lead Site Reliability Engineer

Posted 6 Days Ago
Be an Early Applicant
Noida, Gautam Buddha Nagar, Uttar Pradesh
7+ Years Experience
Information Technology • Design
The Role
Seeking an experienced Lead Site Reliability Engineer to ensure highly reliable, scalable, performant, and fault-tolerant systems supporting critical business applications. Responsibilities include collaborating with development teams, automation scripting, monitoring solutions, post-mortems, mentoring junior team members, leading incident response, and staying updated with industry trends.
Summary Generated by Built In

Lead Site Reliability Engineer 

Are you our TYPE”? 

 

Monotype Global 

Named "One of the Most Innovative Companies in Design" by Fast Company, Monotype brings brands to life through type and technology that consumers engage with every day.  

The company's rich legacy includes a library that can be traced back hundreds of years, featuring famed typefaces like Helvetica, Futura, Times New Roman and more.  

Monotype also provides a first-of-its-kind service that makes fonts more accessible for creative professionals to discover, license, and use in our increasingly digital world. We work with the biggest global brands, and with individual creatives, offering a wide set of solutions that make it easier for them to do what they do best: design beautiful brand experiences. 

 

 Monotype Solutions India  

Monotype Solutions India is a strategic center of excellence for Monotype and is a certified Great Place to Work® three years in a row. The focus of this fast-growing center spans Product Development, Product Management, Experience Design, User Research, Market Intelligence, Research in areas of Artificial Intelligence and Machine learning, Innovation, Customer Success, Enterprise Business Solutions, and Sales.  

 

Headquartered in the Boston area of the United States and with offices across 4 continents, Monotype is the world’s leading company in fonts. It’s a trusted partner to the world’s top brands and was named “One of the Most Innovative Companies in Design” by Fast Company.  

 

Monotype brings brands to life through the type and technology that consumers engage with every day. The company's rich legacy includes a library that can be traced back hundreds of years, featuring famed typefaces like Helvetica, Futura, Times New Roman, and more. Monotype also provides a first-of-its-kind service that makes fonts more accessible for creative professionals to discover, license, and use in our increasingly digital world.  

 

We are seeking an experienced and highly skilled Site Reliability Engineer to join our team. In this role, you will be working with Cross-Functional teams during designing and implementation phase to ensure highly reliable, scalable, performant and fault-tolerant systems that support our critical business applications and services. 

 

 What you’ll be doing: 

 

  • Collaborate closely with development teams to ensure the reliability, observability, performance, and maintainability of applications and systems. 

  • Develop and maintain sophisticated automation scripts and tools to streamline complex tasks and workflows, aiming to improve system reliability and operational efficiency. 

  • Implement advanced monitoring solutions and performance optimization techniques to ensure high system availability and responsiveness. 

  • Lead blameless post-mortems, identify root causes of incidents, and drive continuous improvement initiatives to prevent recurrence.

  • Mentor and provide technical guidance to junior team members, fostering a culture of knowledge sharing and professional growth within the team. 

  • Act as an escalation point for complex technical issues, ensuring timely resolution and effective communication with stakeholders. 

  • Provide leadership during incidents and outages, facilitating incident response efforts and coordinating cross-functional teams to restore services quickly. 

  • Stay updated with industry trends, emerging technologies, and best practices in SRE, DevOps, and cloud computing, and integrate relevant insights into the team's operations. 

  • Drive project tasks forward within a multi-disciplined team, ensuring alignment with project goals and deadlines. 

  • Prepare and present project status updates, metrics, and reports for stakeholders, including Senior Management. 

  • Facilitate functional and cross-functional discussions to resolve issues and drive decision-making, providing guidance and coaching as needed. 

  • Manage risks effectively and proactively, mitigating their impact on project deliverables and overall system stability. 

  • Interpret internal and external business challenges, recommending best practices to enhance products, processes, or services. 

 

What we’re looking for: 

 

  • Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent professional experience. 

  • 7-10 years of hands-on experience in Site Reliability Engineering, DevOps, or a similar role, demonstrating a strong focus on ensuring system reliability and driving automation initiatives. 

  • Proficiency in programming languages such as Python, Groovy, with a proven track record of developing automation scripts and tools to streamline operations. 

  • Extensive expertise in Linux systems administration, along with practical experience in managing cloud computing platforms (e.g., AWS, GCP, Azure), containerization technologies (e.g., Docker, Kubernetes), and infrastructure as code principles using tools like Terraform or CloudFormation. 

  • Solid understanding of networking fundamentals, load balancing strategies, caching mechanisms, and distributed system design principles. 

  • Experience implementing and managing monitoring and observability solutions, including tools such as DataDog, Prometheus, Grafana, and ELK stack, to ensure the health and performance of systems. 

  • Strong problem-solving, analytical, and troubleshooting skills, with the ability to diagnose and resolve complex technical issues efficiently. 

  • Excellent communication and collaboration abilities, with a proven track record of effectively working in cross-functional teams and communicating technical concepts to non-technical stakeholders. 

  • Demonstrated ability to mentor and develop junior team members, fostering a culture of continuous learning and professional growth. 

  • Experience with chaos engineering principles and practices, leveraging tools like Chaos Monkey or Gremlin to proactively identify weaknesses in distributed systems. 

  • Knowledge of machine learning techniques and data analytics in the context of Site Reliability Engineering, enabling data-driven decision-making and predictive analysis. 

  • Familiarity with service mesh technologies such as Istio or Linkerd, facilitating the implementation of resilient and secure microservices architectures. 

  • Active involvement in open-source projects or participation in relevant technical communities, demonstrating a commitment to professional development and knowledge sharing. 

 

Monotype is an Equal Opportunities Employer. Qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status.

Monotype is expanding globally. Proficiency in one or more of the following languages is desirable (not mandatory) for this role: German, Japanese, French, Spanish

#LI-DNI

Top Skills

Java
Python
The Company
London
1,263 Employees
On-site Workplace
Year Founded: 1887

What We Do

Monotype brings brands to life through type and technology that consumers engage with every day.

The company's rich legacy includes a library that can be traced back hundreds of years, featuring famed typefaces like Helvetica, Futura, Times New Roman and more.

Monotype also provides a first-of-its-kind service that makes fonts more accessible for creative professionals to discover, license, and use in our increasingly digital world. We work with the biggest global brands, and with individual creatives, offering a wide set of solutions that make it easier for them to do what they do best: design beautiful brand experiences.

Jobs at Similar Companies

MassMutual India Logo MassMutual India

Data Engineer

Big Data • Fintech • Information Technology • Insurance • Financial Services
Hyderabad, Telangana, IND

Halter Logo Halter

Experienced Mechanical Engineer

Hardware • Information Technology • Internet of Things • Machine Learning • Software • Business Intelligence • Agriculture
Easy Apply
Hybrid
Auckland, NZL
150 Employees

Silverfort Logo Silverfort

Head of Global Channel & Field Marketing

Information Technology • Sales • Security • Cybersecurity • Automation
Remote
United States
357 Employees

Similar Companies Hiring

Halter Thumbnail
Software • Machine Learning • Internet of Things • Information Technology • Hardware • Business Intelligence • Agriculture
Auckland City, NZ
150 Employees
MassMutual India Thumbnail
Insurance • Information Technology • Fintech • Financial Services • Big Data
Hyderabad, Telangana
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account