Site Reliability Engineer
Our Company
Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
The Opportunity
Adobe's Commerce Customer Engineering team is looking for a Site Reliability Engineer to join our Cloud Operations team.
As a high-growth, e-commerce company, we are looking for an experienced SRE with a heavy emphasis on backend operations support. We define, create, and operate Adobe's e-commerce platform that includes client experience, build and deploy analysis, production applications, and infrastructure.
What you'll do
- Work on reliability and performance aspects for database infrastructure pieces that allow Adobe Commerce Cloud to scale.
- Solve complicated issues with live cloud server environments and provide software configuration and tuning recommendations for optimal performance.
- Take ownership and provide feedback on any infrastructure changes related to DB including version upgrades
- Write software layers, scripts, deployment frameworks, tracers, monitors, self-healing/auto remediation tools and automate the processes.
- Analyze SQL queries and provide query optimization recommendations and best practices
- Collaborate with engineers to diagnose and troubleshoot database errors.
- Participate on RCAs and take action items to mitigate issues.
- Participate in shared on-call schedule [follow-the-sun model] managed across SRE/DBRE & Engineering
- Excellent social skills in your interactions with Internal and External Developers/Community/Merchants.
What you need to succeed
- Bachelor's degree on Computer Science or similar
- 3+ years' experience working on PaaS/SaaS handling high transactional environments providing critical 24x7 uptime
- Experience on sophisticated database migrations into cloud native databases (AWS Aurora preferred)
- Fluent on transactional NoSQL DBs: ElasticSearch, Redis
- Experience on LAMP architecture stack. Also proven experience on high-scalable container-based environments (Kubernetes, Docker Swarm, ...)
- Confirmed experience on high level languages: Bash, Python, Ruby, Perl or similar.
- Experience working with hyperscale cloud providers (AWS, Azure, GCP, etc.).
- Experience in securing, monitoring, capacity planning, full-proof DR, backup & recovery for distributed database systems.
- Extensive experience with Monitoring / Logging / Alerting systems.
- Intellectual curiosity to pursue the unknown and to continuously learn.
- Great communication, interpersonal, and teamwork skills to work with internal teams but also our customers.
- Good social skills and desire to work in a dynamic and fast-paced environment
- Experience using/implementing CI/CD frameworks such Jenkins, Travis, Gitlab CI. All about IaaC is also relevant in our stuck so real hands-on using Chef/Puppet, Ansible, Terraform, etc is also welcome!
- Participate in on-call pager rotation
Bonus points for the following:
- AWS/Azure Certification
- Linux Certification
- Kubernetes Certification
- MySQL Certification
- Other database experience/certifications sql/non-sql
- Experience in Lean, Six-Sigma, or Kaizen
- ITIL certification