Site Reliability Engineer - Advanced level
Objectives and Responsibilities
Technical Design, Development & Problem Solving
- Heavy experience within public cloud infrastructure (AWS and Azure) and CI/CD pipeline automation. Be able to effectively translate design requirements into engineering tasks Build system components integrating appropriate technologies as needed Responsible for the following: application migrations, dev-ops tools maintenance & support, container orchestration, tagging, cloud cost management, AMI optimization.
- Work hand-in-hand with enterprise architecture, application partners, cloud product management and cloud operations Deep understanding and experience with Infrastructure as code (IaC) to manage cloud resources, accounts and services Experience building scalable production systems from start to finish adhering to the business needs and incorporating security, scalability, high availability, telemetry and observability
Critical Thinking & Problem Solving
- Awareness of cloud services and supporting integrations. Ability to think out of the box and help resolve critical technical issues Master problem solving techniques by identifying the root cause and provide permanent and timely solutions to the problems. Respect for the IT processes and organizational policies Good stakeholder relationship management skills
Execution & Delivery
- Be self-directed and self-motivated in following up on project tasks and maintenance. Work with fellow engineers, product owners and enterprise stakeholders to help deliver innovative, data integration & platform solutions. Define realistic estimating to assigned tasks, with clear assumptions & acceptance criteria. Independently deliver on assigned tasks within projected time frame Deliver with team spirit in mind (partner with team members, ensure they understand the work being done) to ensure success of the assigned project.
- Adhere to team's delivery methodology (agile, kanban e.tc.) and find/recommend ways to ensure efficiencies and improvements. Look for opportunities to reduce costs, improve function and inject value
Essential Skills:
- See the big picture from business perspective Don't fear complexity and scale Have a software-centric mindset Be comfortable with coding Relish change and frequent releases View problems as opportunities for improvements Ability to communicate with technical and non-technical stakeholders Provide technical leadership for complex, long term initiatives
- Provide communication to leadership on progress & gaps Provide authoritative expertise to others by advising on prioritization, planning, and execution of projects within the subdomain
Leadership:
- Mentor core engineers Partner with business stakeholders and product managers to define project timelines and deliverable at each project stage. Awareness and involvement in identifying the resources which would be required for project delivery. Being able to identify resources from other departments. May not take ownership of these communications, but assisting the Lead when relevant Leads the team on technical decisions and helps drive other team members towards solutions Implements best practices for continuous delivery and continuous integration frameworks
Basic Qualifications
- Bachelor's degree in Computer Science or equivalent, 5 years in architecting and implementing fully automated (IaC/Terraform), secure, reliable, scalable & resilient hybrid-cloud solutions.
- Must have hands-on experience with Kubernetes, microservices architecture
- 2-3 years of experience with Terraform
- Experience with DevOps concepts, tools (containers, (CI/CD - Github, Jenkins, Artifactory, Helm), Chef, Ansible, Puppet etc.) and emerging technologies
- Experience with observability tools such as Splunk, New Relic, Pager Duty
- Experience with infrastructure systems that support enterprise data science and analytics capabilities, including streaming and real-time analytics (Kafka, Spark Streaming, and Snowplow)
- Experience with on-prem to cloud migration
- Experience with cloud security toolsets (Prisma, Zeronorth, Wiz, JFrog Xray, Cloudwatch etc)
- Exposure to network infrastructure (Ex. setting up and managing firewalls, WAFs, network segregation, VPNs and network ACLs)
- Strong written and verbal communication skills
- Able to thrive in a collaborative and cross-functional environment
- AWS /Azure associate certification
Preferred Qualifications:
- · 7+ Years of experience in AWS/AZURE cloud
- · CKA Certification - Certified Kubernetes Administrator or CKAD - Certified Kubernetes Application Developer
- · Subject matter expert in Cloud Security and/or Cloud Networking
- · AWS /Azure certification preferably at professional level
#LI-TM1
MassMutual is an Equal Employment Opportunity employer Minority/Female/Sexual Orientation/Gender Identity/Individual with Disability/Protected Veteran. We welcome all persons to apply. Note: Veterans are welcome to apply, regardless of their discharge status.
]]>