Senior Infrastructure Support Engineer

Posted 9 Days Ago
Be an Early Applicant
Cluj
Senior level
Software
Why does Thoughtworks exist? To create an extraordinary impact on the world through our culture & technology excellence.
The Role
The Senior Infrastructure Support Engineer ensures the operational efficiency and performance of cloud-based infrastructure, addressing incidents and automating processes for continuous improvement. Responsibilities include monitoring operations, collaborating with teams using various tools, executing infrastructure changes, and preparing incident reports. The role requires experience with CI/CD tools, Infrastructure as Code, observability tools, and cloud environments.
Summary Generated by Built In

As a senior Infrastructure Support Engineer, you play a vital role in maintaining technical excellence and operational efficiency, with a primary focus on cloud environments. You'll help clients through the transition to agile, value-focused practices, emphasizing shared responsibility and continuous improvement. You will monitor infrastructure performance, respond to incidents promptly and maintain resources in line with modern standards, incorporating sustainable practices.

Job responsibilities

  • You will keep a vigilant eye on the operations of shipped products and services following the agreed upon “Eyes on glass/Follow the sun” engagement models.
  • You will monitor product/service operations against key performance indicators defined by the business and take necessary actions in response to detected deviations.
  • You will define and document the appropriate responses to various kinds of incident scenarios in collaboration with the Service Reliability Engineering (SRE) team and client stakeholders, and prepare runbooks; you will reduce the human effort in day-to-day operations by automating operations, using the latest tech stacks befitting the task and improving the overall efficiency of the entire team as time progresses; working on a two-week sprint backlog together in an agile DevOps team (remote only).
  • Collaboration is increasingly carried out with the tools Mattermost, MS-Teams, GitHub and Confluence.
  • Independent processing of firewall changes in ServiceNow and Git issues incl. connectivity check, integration/network troubleshooting within the Azure Cloud and Onprem data centers.
  • The cloud product to be supported, which offers on-prem integration for over 50 applications in the cloud, is implemented and further developed by the entire DevOps team using Terraform, Bash and Powershell scripts.
  • Preparation of the Ops meeting protocol using the capacity & availability dashboards and monitoring alerts in Azure.
  • There are regular coordination meetings within the sprint in which the entire DevOps team participates: Weekly Review Meeting 1h, Bi-Weekly Ops Meeting 1h, Bi-Weekly Sprint Planning Meeting 1-2h.
  • You will prepare incident Root cause analysis (RCA) and postmortem reports, explaining analyses and outlining preventive measures to clients; Collaborating with SRE, development teams or independently, your role is to ensure clear communication and proactive steps for future incident prevention.
  • You will implement service/product reliability improvement in collaboration with service reliability engineers by writing infrastructure/observability configuration code.

Job qualifications

Technical Skills
  • You have hands-on experience in using any CI/CD tools such as Jenkins, CircleCI or Gitlab for executing deployments.
  • You have knowledge of Infrastructure as Code (IAC) tech stacks such as Terraform, Ansible, ARM or Cloudformation to provision and manage infrastructure.
  • You have working experience in using observability tools for logging, monitoring, tracing and alerting, e.g.: Datadog/Prometheus/Grafana, ELK/EFK/Splunk (Datadog is a plus).
  • You have experience with Azure.
  • You have hands-on experience executing most common operations in managing workloads on any container ecosystem tech stacks. e.g.: Docker, Kubernetes, Openshift, etc.
  • You understand system performance tuning and scaling to handle common heavy load scenarios along with concepts of highly available systems and basics of disaster recovery solutions, and are familiar with failover, backup and recovery concepts.
  • You have experience operating a Linux OS such as RHEL or a Debian-Based OS and are familiar with most common Linux OS operations and commands, reading and tweaking Bash scripts and managing runtime environment configurations such as Env Vars, Logs, etc.
  • You have experience supporting backend storage solutions such as SQL and NoSQL databases, e.g.: Postgres and MongoDB, and caching solutions such as Redis and Memcached.
  • You have experience in networking configuration and security, and are familiar with common networking setup and security practices, e.g.: loading, balancing, proxies, transport layer security (TLS) and certificate management, and an understanding of standard network protocols and configurations.
  • You have a good understanding of fundamental concepts of APIs such as request, response, headers, authentication, JSON payloads, etc.

Professional Skills

  • You have strong communication and articulation skills, are proficient in English and able to confidently hold a Q&A discussion with senior stakeholders.
  • You have people skills with an emphasis on close collaboration with multiple, cross-functional teams from the client side or Thoughtworks.
  • You have the ability to work under pressure with composure during production incidents.
  • You have strong analysis, deduction and reasoning skills, with the ability to identify patterns in data and draw conclusions.
  • You have strong drive and ownership to sign up and deliver work when called upon without being too concerned with role boundaries.
  • You are willing to be part of a rotation- and need-based 24x7 available team.

Other things to know

Learning & Development

There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you. But we also balance autonomy with the strength of our cultivation culture. This means your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. We see value in helping each other be our best and that extends to empowering our employees in their career journeys.

About Thoughtworks

Thoughtworks is a global technology consultancy that integrates strategy, design and engineering to drive digital innovation. For 30+ years, our clients have trusted our autonomous teams to build solutions that look past the obvious. Here, computer science grads come together with seasoned technologists, self-taught developers, midlife career changers and more to learn from and challenge each other. Career journeys flourish with the strength of our cultivation culture, which has won numerous awards around the world.
Join Thoughtworks and thrive. Together, our extra curiosity, innovation, passion and dedication overcomes ordinary.

#LI-Onsite

Top Skills

Terraform,Bash,Powershell,Jenkins,Circleci,Gitlab,Ansible,Arm,Cloudformation,Datadog,Prometheus,Grafana,Elk,Efk,Splunk
The Company
HQ: Chicago, IL
7,674 Employees
Hybrid Workplace
Year Founded: 1993

What We Do

We are a leading global technology consultancy that integrates strategy, design and software engineering to enable enterprises and technology disruptors across the globe to thrive as modern digital businesses.

Why Work With Us

As technologists, we have a unique role to play in how technology should benefit all of society, pursuing a more equitable future. Part of that role is to continuously educate ourselves on the issues that matter to the causes we believe in. We recognize our privilege and strive to see the world from the perspective of the most vulnerable.

Gallery

Gallery

Similar Jobs

Cluj, ROU
7674 Employees

Grubhub Logo Grubhub

Senior Software Engineer- Backend

eCommerce • Food • Sales • Software
Cluj, ROU
10000 Employees

Grubhub Logo Grubhub

Senior Software Engineer- Backend

eCommerce • Food • Sales • Software
Hybrid
Cluj, ROU
10000 Employees

Grubhub Logo Grubhub

Senior Web Engineer

eCommerce • Food • Sales • Software
Hybrid
Cluj, ROU
10000 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account