Site Reliability Engineer

Reposted 17 Days Ago
Be an Early Applicant
4 Locations
In-Office
Mid level
Mobile • Software
We help create meaningful, timely and effortless interactions between companies and their customers.
The Role
The Site Reliability Engineer will drive reliability initiatives, improve processes, manage incidents, develop tooling, and collaborate with teams to enhance system performance and reliability.
Summary Generated by Built In

Working at Infobip means being part of something truly global. With 75+ offices across six continents, we’re not just building technology — we’re shaping how more than 80% of the world connects and communicates.


As employees, we take pride in contributing to the world’s largest and only full-stack cloud communication platform. But it’s not just what we do, it’s how we do it: with curiosity, passion, and a whole lot of collaboration.


We operate with an AI-first mindset, embedding intelligent tools into our daily workflows to work smarter and more efficiently. Every role here benefits from and contributes to this approach.


If you're looking for meaningful work and challenges that grow you in a culture where people show up with purpose, this is your opportunity.

Let’s build what’s next, together.

Why is this position important at Infobip? 
 

We are looking for engineers who enjoy solving problems, have a passion for quality, and are data-driven and analytical. The role we are hiring is Site Reliability Engineer, whose primary focus is on driving reliability initiatives and promoting reliability practices across the organization. For this position, we are specifically looking for an SRE with a strong development background (backend or full stack), comfortable with scripting and building internal tooling. 
 
SRE understands the client use cases, usage, and impact, establishes and improves the incident management process, drives post incident reviews where they focus on providing incident (meta)-data and analyzing trends as well as act as advisors on follow-up actions, especially promoting systematic approach and long-term solutions. 
 

The team for which we are hiring has a diverse skill set as they come from different backgrounds, from engineering, product management, testing and routing operations. They have acquired that skill set over very long careers which makes them a nightmare for problems and incidents. Currently, they operate as a form center of practice (excellence) and are helping Infobip resolve a diverse set of complex problems in the reliability spectrum. 
 
 

SREs are: 
 

1. Owners of Incident management and its lifecycle 

  • Working on process alignment with the rest of the company 

  • Working on streamlining and process automation 

  • Driving process adoption and monitoring adoption 

  • Creating objective and actionable incident reporting  (monthly, quarterly, early incident reports for management; on-demand product reliability reports) 

2. Advisors/Experts on reliability topics (advocating platform reliability) 

  • Providing onboarding and education on reliability topics and on how to improve reliability 

  • Driving reliability community/mindset 

  • Promoting a blameless incident culture 

  • Identifying risks and promoting a systematical approach to  problems and long-term solutions 

3. Helping teams define, develop, monitor and maintain SLOs/SLIs for  their products 
 

4. Providing a client-centric point of view on incidents and products 
 

5. Shortening incident response times (detect, engage, fix) by helping with troubleshooting and improvements 
 

6. Developing internal tooling and automation to reduce toil and speed up incident response 
 

7. Providing objective quality insights 

  • Raising awareness of areas of improvement based on historical data 

  • Providing actionable insights based on quality trends and metrics 

  • Providing guidance for outliers that are outside of the expected baseline 
     

What will the main responsibilities be 

  • Discovering problems, defining, and solving tasks under the guidance of more senior engineers 

  • Designing and implementing automation and tooling (script, services, dashboards) to:improve incident detection and response, reduce manual work (toil) and provide better reliability insights 

  • Overviewing incidents in a production environment, helping others in complex incident response using incident response  strategies 

  • Troubleshooting platform - wide problems: understanding  the big picture and guiding towards resolution of  the reliability-related problem 

  • Collaborating with product and development teams to integrate reliability improvements into code and  architecture 

  • Actively investing in learning about the development process,  technologies, system  architecture, platform and  products,  and clients' requirements in Infobip context 

  • Active participation in the incident review and learning from incidents 

  • Communicating about problems and solutions on the right  level of abstraction depending on the audience

  • Sharing knowledge with the team/requirement area

More about you: 

- Highschool, Bachelors or Masters degree 

- 3+ years in positions like: ​

  • Software Engineer/ Backend Engineer/Full Stack  Engineer,  or  ​

  • DevOps / SRE with strong scripting/programming experience in at least one of: Java, Go, Python, Bash/Powershell or similar

- Plus familiarity with some of the following: 

  • Monitoring, logging and alerting tooling 

  • Network and Linux fundamentals 

  • System/architectural design and distributed systems 

  • Databases and querying languages (SQL/NoSQL) 

- Proficient in English, with good understanding of the development  process, risk analysis, and problem solving 

- Focus on clients, strong teamwork skills, curiosity and eagerness to learn, technical skills, execution efficiency, continuous improvement mindset, and great analytical and communication skills 

Why you'll love it here

• Financial rewards & recognition - A fair compensation aligned with your experience, industry, and market standards, performance-driven bonuses, regular reviews to support your growth and recognize your contributions, and a culture that values your impact.
• Flexible work arrangements - We combine in-person collaboration with remote work and flexible working hours, because great ideas happen everywhere - and not always between 9 and 5.
• ESOP (Employee Stock Ownership Plan) - As an Infobip employee, you’ll have the opportunity to share in our company’s success through stock options.
• Work-life balance and Well-being  - We offer time off when you need it, special leave days for life’s big moments, and a flexible hybrid work model tailored to local regulations.
• Career mobility - Your career is a journey. With internal mobility, upskilling, and mentorship, we help you shape your path. 
• Professional development - Learning never stops. Onboarding, mentorship, and training programs help you grow - no matter where you start.
• International mobility - Ready to take your career global? Explore short and long-term opportunities in our Hubs worldwide. 
While some benefits may vary by location, our goal remains the same: to support your growth, well-being, and success - wherever you are.

Apply if you like this job description, and for more details - feel free to contact recruiter Zrinka on LinkedIn

Diversity drives connection

Infobip is built on diverse backgrounds, perspectives, and talents. We’re proud to be an equal-opportunity employer and are committed to fostering an inclusive workplace.

No matter your race, gender, age, background, or identity — if you have the passion and skills to thrive, there’s a place for you here.

All qualified applicants will receive consideration for employment without regard to race, color, ancestry, religion, age, sex, sexual orientation, gender, gender identity, national origin, citizenship, disability, veteran status or any other part of one's identity.

Read more about our hiring process.

#LI-ZA1

Top Skills

Bash
Go
Java
NoSQL
Powershell
Python
SQL
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: London
3,100 Employees
Year Founded: 2006

What We Do

HIRING NOW! Infobip helps businesses build connected experiences across all stages of the customer journey. Accessed through a single platform, Infobip’s omnichannel engagement, identity, user authentication and contact center solutions help businesses and partners overcome the complexity of consumer communications to grow business and increase loyalty.

We work with large organizations, including seven of the world’s 10 biggest brands, across sales and marketing, operations, human resources, IT and security, and customer service. Our mobile engagement solutions help optimize operational functions, enhance internal and external communications, improve customer experiences, reduce support costs, generate new revenue, and gain a competitive advantage.

Whether two-factor authentication for high-tech retailers, emergency alerts for global giants, or mobile-giving solutions for large charities, Infobip offers the scale, service flexibility, reliability, and heritage to provide interactive solutions for today and in the future.

Companies choose Infobip for our domain expertise, service flexibility, demonstrated performance and reliability, global scale, and corporate maturity.

Why Work With Us

We work with some of the biggest enterprises in the world to make their customers’ lives better. But we’re small enough that every person counts. We’ve got a passion for our technology to rival any start-up. Our people are the best and most professional in the world. But we’re a suits-and-bureaucracy free zone.

Gallery

Gallery

Similar Jobs

Nebius Logo Nebius

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Consulting
In-Office or Remote
33 Locations
473 Employees

Nebius Logo Nebius

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Consulting
In-Office or Remote
29 Locations
473 Employees

Nebius Logo Nebius

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Consulting
In-Office or Remote
29 Locations
473 Employees

Nebius Logo Nebius

Senior Site Reliability Engineer

Artificial Intelligence • Information Technology • Consulting
In-Office or Remote
29 Locations
473 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account