Engineering Director (Infra Hygiene/ Observability) - Cloud Operations (Phoenix, AZ)

| Phoenix – Mesa – Scottsdale, AZ
Apply Now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.
You Lead the Way. We've Got Your Back. At American Express, we know that with the right backing, people and businesses have the power to progress in incredible ways. Whether we're supporting our customers' financial confidence to move ahead, taking commerce to new heights, or encouraging people to explore the world, our colleagues are constantly redefining what's possible - and we're proud to back each other every step of the way. When you join #TeamAmex, you become part of a diverse community of over 60,000 colleagues, all with a common goal to deliver an exceptional customer experience every day.

American Express is in an exciting cloud transformation journey driven by an energetic team of high performers. This group is nimble and creative with the power to shape our technology and product roadmap. If you have the talent and desire to deliver innovative solutions for cloud operations to support our customers seamlessly across physical, digital, mobile, and social media, join our transformation team!

Do you want to operate and run a highly available, global scale enterprise-wide IaaS cloud platform on Linux and Windows platforms? Then you should consider this innovative and creative opportunity where you can be a key transformative contributor in Cloud Operations (PaaS/IaaS) for American Express.

You will lead the operations team and contribute to building processes and tools that allow monitoring, diagnosing, and debugging of the platform services and its hosted applications.

You will architect, design, and implement improvements in process and tools to improve global cloud stability and resilience. You will contribute to scalable, secure, highly available infrastructure on a variety of platforms such as Redhat Linux, Microsoft Windows, Cohesity, VMWare and other related technologies and get exposure to various cutting-edge tools and technologies.

Technical Skills
  • 10+ years of combined infrastructure engineering/software engineering/site operations
  • Deep understanding of cloud computing technologies including bare metal, business drivers, and emerging computing trends
  • Experience with Lead/contribute to engineering efforts from design to implementation, solving complex technical challenges around monitoring distributed systems at scale.
  • Experience with driving best practices in monitoring, alerting, and performance.
  • Hands-on & practical experience of log aggregation related to Cloud Platforms, server-less compute, and micro-services (Docker)
  • Leading Infrastructure projects that improve observability tools and platforms: metrics, dashboarding, alerting, logging, application performance management, blackbox monitoring and distributed tracing
  • Experience with monitoring tools like Prometheus, Grafana, Splunk, Dynatrace, Nagios, PagerDuty, AWS Cloudwatch, Elastic Tools: Beats, Logstash, Elasticsearch, Kibana, etc.
  • Experience with configuration management and automation technologies such as Ansible, Puppet, Shell Scripting, Python, Golang, Jenkins, GitHub, and PowerShell
  • Basic understanding of infrastructure design including on-prem and public cloud, networking, virtualization, security, load balancers, caching, web servers and storage
  • Good understanding of Windows, Linux, VMware platforms
  • Experience with time series databases
  • Strong trouble-shooting skills across a broad and diverse population and environment

Non-Technical Skills
  • Familiar with agile or other rapid application development methods
  • Ability to effectively interpret technical and business objectives and to articulate solutions
  • Ability to think abstractly and deal with problems
  • Ability to enable business capabilities through innovation
  • Looks proactively beyond the obvious for continuous improvement opportunities
  • Demonstrated willingness to learn new technologies, and takes pride in how fast they develop working software
  • Driving decisions collaboratively, resolving conflicts and ensuring follow through
  • Owns people/process/tools of cloud operations and SRE program.
  • Provides continuous support for ongoing application availability.
  • Participates in cloud scale, architecture, design, and maintenance
  • Hire and retain talent. Build, manage and lead a team of highly talented engineers
  • Keen interest in delivering a great developer experience (DX)
  • Develops deep understanding of tie-ins with other systems and platforms
  • Identifies opportunities to adopt innovative technologies
  • Works closely with infrastructure, product, engineering, and solutions on feature sets that impact multiple platforms and products
  • Demonstrate ability to proactively look for process improvement opportunities, challenge conventional practices, and adopt new methods and best practices
  • Demonstrate verbal and written communication skills; ability to communicate with all levels of the organization, clearly and concisely present issues, alternatives, and recommendation(s)
  • Demonstrate understanding of priorities and effective work procedures, self-manage work time and prioritize multiple tasks and problems


As part of our diverse tech team, you can architect, code and ship software that makes us an essential part of our customers' digital lives. Here, you can work alongside talented engineers in an open, supportive, inclusive environment where your voice is valued, and you make your own decisions on what tech to use to solve challenging problems. Amex offers a range of opportunities to work with the latest technologies and encourages you to back the broader engineering community through open source. And because we understand the importance of keeping your skills fresh and relevant, we give you dedicated time to invest in your professional development. Find your place in technology on #TeamAmex

Educational Qualifications
  • Bachelor's in CS or related field. Master's degree in computer science, computer engineering, or other technical discipline, or equivalent work experience, is preferred


We back our colleagues with the support they need to thrive, professionally and personally. That's why we have Amex Flex, our enterprise working model that provides greater flexibility to colleagues while ensuring we preserve the important aspects of our unique in-person culture. Depending on role and business needs, colleagues will either work onsite, in a hybrid model (combination of in-office and virtual days) or fully virtually.

If the role you are applying for is designated as hybrid or onsite, you will be required to demonstrate that you have completed your primary COVID-19 vaccination series (i.e., 2 doses for Moderna/Pfizer and 1 dose for J&J) in order to work in or visit any of our offices. This requirement is subject to legally required accommodations.

Employment eligibility to work with American Express in the U.S. is required as the company will not pursue visa sponsorship for these positions.

American Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability status, age, or any other status protected by law.
More Information on American Express
American Express operates in the Fintech industry. The company is located in New York, NY and New York, NY. American Express was founded in 2022. It has 73317 total employees. It offers perks and benefits such as Flexible Spending Account (FSA), Disability Insurance, Dental Benefits, Vision Benefits, Health Insurance Benefits and Life Insurance. To see all 228 open jobs at American Express, click here.
Read Full Job Description
Apply Now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.

Similar Jobs

Apply Now
By clicking continue you agree to Built In’s Privacy Policy and Terms of Use.
Save jobView American Express's full profileFind similar jobs