DevOps Engineer at MUFG (Dallas, TX)
Sorry, this job was removed at 4:16 p.m. (CST) on Monday, September 5, 2022
By clicking Apply Now you agree to share your profile information with the hiring company.
Your potential. Your opportunity.
Do you want your voice heard and your actions to count?
Discover your opportunity with Mitsubishi UFJ Financial Group (MUFG), the 5th largest financial group in the world. Across the globe, we're 180,000 colleagues, striving to make a difference for every client, organization, and community we serve. We stand for our values, building long-term relationships, serving society, and fostering shared and sustainable growth for a better world.
With a vision to be the world's most trusted financial group, it's part of our culture to put people first, listen to new and diverse ideas and collaborate toward greater innovation, speed and agility. This means investing in talent, technologies, and tools that empower you to own your career.
Join MUFG, where being inspired is expected and making a meaningful impact is rewarded.
Using Infrastructure-as-Code (Cloud formation, Teraform, scripting) , and Continuous Integration / Continuous Delivery Pipelines in Jenkins to handle the full cloud-native application lifecycle. Partnering with Risk Management and Security teams to identify the standards and lead the measurement of and deployment of secured and compliant cloud infrastructure and services. Measuring Service Level Indicators and Service Level Objectives for a distributed software system. Performing site reliability engineering; developing software configuration management implementations using declarative infrastructure as code (Ansible, Chef, or Puppet configuration management toolchain); and developing and managing cloud native topology structures and deploying containers to a Kubernetes cluster. Performing site reliability engineering development efforts to improve availability and performance of software systems (debugging, triaging and identifying root cause for failure in a production environment, and performing postmortem analysis). Defining Standard Operating Procedures (SOPs) and Runbook for troubleshooting production issues; developing software operations resilience patterns for deployed software infrastructure and implementing highly available and resilient software systems. Establishing error budget for applications and infrastructures to reduce cost of failure. Partnering with application teams to understand performance and capacity requirements of cloud-native solutions and services. Leading the establishment of SLIs and SLOs for enterprise cloud infrastructure and services; analyzing traffic and data to ensure we are using our technologies for optimal performance. Leading the proactive efforts in order to protect product availability and performance. Building and maintaining cloud operations tools for monitoring, notifications, trending, and analysis. Driving the standardization, simplification, and automation of operational workflows. Providing Level 3 support for troubleshooting and services restoration in Production.
Qualifications - External
Education: Bachelor's degree in Computer Science, Information Technology or a related field (or foreign equivalent degree). In lieu of a Bachelor's degree, employer will accept three years of site reliability engineering experience with a focus on safe and reliable software Production deployments and measuring application service performance and availability.
Experience: 7 years of DevOps experience measuring Service Level Indicators and Service Level Objectives for a distributed software system; and 3 years of experience must include Site Reliability Engineering (debugging, triaging and identifying root cause for failure in a production environment, and performing postmortem analysis; developing software configuration management implementations using declarative infrastructure as code (Ansible, Chef, or Puppet configuration management toolchain); developing and managing cloud native topology structures and deploying containers to a Kubernetes cluster; defining Standard Operating Procedures (SOPs) and Runbook for troubleshooting production issues; developing software operations resilience patterns for deployed software infrastructure and implementing highly available and resilient software systems; and establishing error budget for applications and infrastructures to reduce cost of failure.
Other: Required to work nights and weekends on occasion for deployments or production issues.
Other: Position may allow telecommuting up to 4 days a week.
Location: Irving, TX 75038
Reference internal requisition #10054200-WD.
We are committed to leveraging the diverse backgrounds, perspectives and experience of our workforce to create opportunities for our people and our business; Equal Opportunity Employer: Minority/Female/Disability/Veteran.
The above statements are intended to describe the general nature and level of work being performed. They are not intended to be construed as an exhaustive list of all responsibilities duties and skills required of personnel so classified.
We are proud to be an Equal Opportunity/Affirmative Action Employer and committed to leveraging the diverse backgrounds, perspectives and experience of our workforce to create opportunities for our colleagues and our business. We do not discriminate on the basis of race, color, national origin, religion, gender expression, gender identity, sex, age, ancestry, marital status, protected veteran and military status, disability, medical condition, sexual orientation, genetic information, or any other status of an individual or that individual's associates or relatives that is protected under applicable federal, state, or local law.
Do you want your voice heard and your actions to count?
Discover your opportunity with Mitsubishi UFJ Financial Group (MUFG), the 5th largest financial group in the world. Across the globe, we're 180,000 colleagues, striving to make a difference for every client, organization, and community we serve. We stand for our values, building long-term relationships, serving society, and fostering shared and sustainable growth for a better world.
With a vision to be the world's most trusted financial group, it's part of our culture to put people first, listen to new and diverse ideas and collaborate toward greater innovation, speed and agility. This means investing in talent, technologies, and tools that empower you to own your career.
Join MUFG, where being inspired is expected and making a meaningful impact is rewarded.
Using Infrastructure-as-Code (Cloud formation, Teraform, scripting) , and Continuous Integration / Continuous Delivery Pipelines in Jenkins to handle the full cloud-native application lifecycle. Partnering with Risk Management and Security teams to identify the standards and lead the measurement of and deployment of secured and compliant cloud infrastructure and services. Measuring Service Level Indicators and Service Level Objectives for a distributed software system. Performing site reliability engineering; developing software configuration management implementations using declarative infrastructure as code (Ansible, Chef, or Puppet configuration management toolchain); and developing and managing cloud native topology structures and deploying containers to a Kubernetes cluster. Performing site reliability engineering development efforts to improve availability and performance of software systems (debugging, triaging and identifying root cause for failure in a production environment, and performing postmortem analysis). Defining Standard Operating Procedures (SOPs) and Runbook for troubleshooting production issues; developing software operations resilience patterns for deployed software infrastructure and implementing highly available and resilient software systems. Establishing error budget for applications and infrastructures to reduce cost of failure. Partnering with application teams to understand performance and capacity requirements of cloud-native solutions and services. Leading the establishment of SLIs and SLOs for enterprise cloud infrastructure and services; analyzing traffic and data to ensure we are using our technologies for optimal performance. Leading the proactive efforts in order to protect product availability and performance. Building and maintaining cloud operations tools for monitoring, notifications, trending, and analysis. Driving the standardization, simplification, and automation of operational workflows. Providing Level 3 support for troubleshooting and services restoration in Production.
Qualifications - External
Education: Bachelor's degree in Computer Science, Information Technology or a related field (or foreign equivalent degree). In lieu of a Bachelor's degree, employer will accept three years of site reliability engineering experience with a focus on safe and reliable software Production deployments and measuring application service performance and availability.
Experience: 7 years of DevOps experience measuring Service Level Indicators and Service Level Objectives for a distributed software system; and 3 years of experience must include Site Reliability Engineering (debugging, triaging and identifying root cause for failure in a production environment, and performing postmortem analysis; developing software configuration management implementations using declarative infrastructure as code (Ansible, Chef, or Puppet configuration management toolchain); developing and managing cloud native topology structures and deploying containers to a Kubernetes cluster; defining Standard Operating Procedures (SOPs) and Runbook for troubleshooting production issues; developing software operations resilience patterns for deployed software infrastructure and implementing highly available and resilient software systems; and establishing error budget for applications and infrastructures to reduce cost of failure.
Other: Required to work nights and weekends on occasion for deployments or production issues.
Other: Position may allow telecommuting up to 4 days a week.
Location: Irving, TX 75038
Reference internal requisition #10054200-WD.
We are committed to leveraging the diverse backgrounds, perspectives and experience of our workforce to create opportunities for our people and our business; Equal Opportunity Employer: Minority/Female/Disability/Veteran.
The above statements are intended to describe the general nature and level of work being performed. They are not intended to be construed as an exhaustive list of all responsibilities duties and skills required of personnel so classified.
We are proud to be an Equal Opportunity/Affirmative Action Employer and committed to leveraging the diverse backgrounds, perspectives and experience of our workforce to create opportunities for our colleagues and our business. We do not discriminate on the basis of race, color, national origin, religion, gender expression, gender identity, sex, age, ancestry, marital status, protected veteran and military status, disability, medical condition, sexual orientation, genetic information, or any other status of an individual or that individual's associates or relatives that is protected under applicable federal, state, or local law.
More Information on MUFG
MUFG operates in the Fintech industry. The company is located in Oakland, CA, San Francisco, CA, Los Angeles, CA, Tempe, AZ, Chicago, IL, Raleigh, NC, Jersey City, NJ, New York, NY and Boston, MA. It has 30196 total employees. It offers perks and benefits such as Flexible Spending Account (FSA), Disability insurance, Dental insurance, Vision insurance, Health insurance and Life insurance. To see all jobs at MUFG, click here.
Read Full Job Description