Title: Specialist, DevOps & SRE (Full-stack Engineer)
Synopsis: Designs and develops internet-scale applications, supporting them throughout their lifecycle while contributing to DevOps practices.
Responsibilities:
- Developing and maintaining top-notch software applications built on private & public cloud.
- Develop and maintain applications using Spring Boot framework.
- Raising the bar on application & infrastructure code quality.
- Strengthening the application and platform Security by adopting security best practices.
- Ensuring the Compatibility and Accessibility conformity along with various Non-Functional Requirements throughout the application lifecycle.
- Automating functional & other quality tests at different layers of the application to uncover issues as early as possible.
- Improving application's ability to continuously integrate & deploy to facilitate frequent and smaller releases with no customer impact.
- Baking Observability into the applications, actively monitoring and resolving issues promptly.
- Monitoring various performance indicators of the applications and making continuous improvements and optimizations.
- Assist in maintaining and enhancing CI/CD pipelines (e.g., Jenkins, GitHub Actions, GitLab CI).
- Help manage and monitor cloud infrastructure (e.g., AWS, Azure, GCP).
- Own infrastructure as code (IaC) using tools like Terraform, CloudFormation, or ARM
- Write scripts to automate routine tasks (using Bash, Python, or similar).
- Implement and manage containerization and orchestration platforms (Docker, Kubernetes, ECS).
- Monitor system performance, application logs, and deployment workflows.
- Troubleshoot complex issues and lead incident response and root-cause analysis.
- Lead vulnerability remediation and security incident support.
- Document DevOps processes and standard operating procedures.
- Establish and improve SRE practices such as SLIs, SLOs, error budgets, and capacity planning.
- Continuously optimize cost, performance, and reliability (FinOps mindset).
- Select, evaluate, and standardize DevOps tools and platforms.
Requirements:
- A bachelor's degree in Software Engineering, Computing or a related field.
- Minimum of 5 years of experience in Software Engineering and DevOps.
- An up-to-date knowledge on industry trends & new developments in programming, application development and lifecycle management.
- A passion for solving problems with innovative & practical solutions.
- Strong analytical and reasoning skills with the ability to visualize the outcomes.
- Thorough knowledge on algorithms & data structures.
- Proficiency in at least two high level programming languages such as Java, C#, Python and the ability to quickly adopt languages & techniques.
- Strong knowledge of Java and the Spring framework.
- Experience in designing RESTful APIs.
- Experience with databases such as PostgreSQL and MongoDB.
- Exposure to any cloud platform (AWS, GCP, Azure)
- Ability to craft quality code and automate various types of tests on application components.
- Proficiency in monitoring applications, troubleshooting and fixing application issues.
- Thorough knowledge on the Non-Functional requirements of applications such as Security, Accessibility, Compatibility, Observability & Availability.
- Working knowledge on continuous integration and deployment.
Skills Required
- Bachelor's degree in Software Engineering, Computing or a related field
- Minimum of 5 years of experience in Software Engineering and DevOps
- Proficiency in at least two high-level programming languages (e.g., Java, C#, Python)
- Strong knowledge of Java and the Spring framework
- Experience designing RESTful APIs
- Experience with databases such as PostgreSQL and MongoDB
- Exposure to cloud platforms (AWS, GCP, Azure)
- Experience with CI/CD tools and pipelines (Jenkins, GitHub Actions, GitLab CI)
- Experience with Infrastructure as Code (Terraform, CloudFormation, ARM)
- Experience with containerization and orchestration (Docker, Kubernetes, ECS)
- Scripting experience to automate tasks (Bash, Python or similar)
- Thorough knowledge of algorithms and data structures
- Knowledge of non-functional requirements: Security, Accessibility, Observability, Availability
- Ability to craft quality code and automate tests across application layers
- Experience monitoring, troubleshooting, incident response, and root-cause analysis
- Experience establishing SRE practices (SLIs, SLOs, error budgets, capacity planning)
What We Do
We are the world’s learning company with more than 22,500 employees operating in 70 countries. We provide content, assessment and digital services to learners, educational institutions, employers, governments and other partners globally. We are committed to helping equip learners with the skills they need to enhance their employability prospects and to succeed in the changing world of work. We believe that wherever learning flourishes so do people.







