DevOps Developer

Reposted Yesterday
Be an Early Applicant
Praha, Hlavní město Praha
In-Office
Mid level
Artificial Intelligence • Big Data • Information Technology • Security • Software
The Role
The Site Reliability Engineer will enhance observability across AWS and GCP, maintaining frameworks, managing Datadog metrics, and supporting incident readiness.
Summary Generated by Built In
Location: Praha, Czechia

Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more. More than 30,000 organizations already rely on us to verify the identities of people and things, grant access to digital services, analyze vast quantities of information and encrypt data to make the connected world more secure.

Thales in the Czech Republic employs over 400 people from 45 different nationalities. A total of 15 teams work on projects for government agencies, banking, mobile services and the Internet Of Things (IoT) technology. At the core of our business is the development of software which we configure and embed in a multitude of different devices and form factors. These include many kinds of payment cards, SIM cards, travel passes, secure eBanking devices, authentication tokens, machine identification modules (MIM), and secure ID documents including ePassports, eID and eHealth cards, as well as eDriving licenses. Because of the international environment surrounding us every day, it comes as no surprise that English is our official corporate language.

We are looking for Site Reliability Engineer to join us at Thales and work with our Payment Solutions. The Site Reliability Engineer empowers product, delivery, and SRE teams to implement a holistic observability approach across AWS and GCP. We design observability standards, build reusable frameworks and partner with teams to achieve end-to-end visibility—from Node.js and Java services to business outcomes. Our mission: make service performance measurable, detect incidents proactively, and accelerate investigations with trustworthy telemetry.

Day in Life of SRE:

Build and maintain observability frameworks for AWS/GCP

  • Create reusable Datadog instrumentation for Node.js and Java
  • Provide auto-instrumentation templates and enforce observability quality standards
  • Publish Terraform modules for Datadog resources and cloud integrations

Own Datadog dashboards and measurement standards

  • Define and curate source-of-truth dashboards and KPIs
  • Establish golden signals and semantic conventions across services
  • Manage observability-as-code repos in GitLab

Improve monitoring, alerting, and incident readiness

  • Design precise, low-noise Datadog monitors and routing
  • Implement synthetics for critical flows and correlate with traces/logs
  • Partner with SREs on SLOs, error budgets, and incident triggers

Drive continuous learning and adoption

  • Turn post-incident learnings into improved monitors, dashboards, and CI/CD checks
  • Deliver training, documentation, and hands-on support for developers and SREs

Consult, enable, and optimize

  • Coach teams on instrumentation and APM best practices
  • Strengthen AWS/GCP observability integrations and tagging strategy
  • Optimize Datadog cost, sampling, retention, and cardinality; rationalize monitors

Typical interactions:

  • SRE: alert quality, troubleshooting, SLOs, post-incident reviews

  • Product/Dev: instrumentation, trace propagation, business KPIs

  • Platform/Infra: cloud integrations, Terraform, RBAC, cost/performance

  • Security/Compliance: telemetry governance, PII controls, retention policies

  • Leadership: service health roll-ups, reliability and adoption metrics

Skills & experience:

  • Strong engineering background in Node.js and/or Java (Datadog dd-trace, async context propagation, middleware patterns)

  • Cloud expertise in AWS — serverless, containers, managed services, and integrating cloud telemetry with Datadog

  • Automation skills with GitLab CI/CD and Terraform (Datadog resources, modules, workflows)

  • Datadog proficiency — APM, logs, metrics, synthetics, monitors, SLOs, and observability-as-code practices

  • Observability mindset — defining SLIs/SLOs, improving alert quality, and supporting the full incident lifecycle

  • Strong communication skills — clear documentation, training delivery, and confident English communication with distributed teams

At Thales we provide CAREERS and not only jobs. With Thales employing 80,000 employees in 68 countries our mobility policy enables thousands of employees each year to develop their careers at home and abroad, in their existing areas of expertise or by branching out into new fields. Together we believe that embracing flexibility is a smarter way of working. Great journeys start here, apply now!

Top Skills

AWS
Datadog
GCP
Gitlab Ci/Cd
Java
Node.js
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Arlington, VA
63,258 Employees

What We Do

Thales is a global high technology leader investing in digital and “deep tech” innovations – connectivity, big data, artificial intelligence, cybersecurity and quantum technology – to build a future we can all trust, which is vital to the development of our societies. The company provides solutions, services and products that help its customers – businesses, organisations and states – in the defence, aeronautics, space, transportation and digital identity and security markets to fulfil their critical missions, by placing humans at the heart of the decision-making process.

Similar Jobs

Merkle Logo Merkle

Devops Engineer

AdTech • Marketing Tech
In-Office
3 Locations
4000 Employees

dentsu Logo dentsu

Devops Engineer

AdTech • Marketing Tech
In-Office
3 Locations
15492 Employees
In-Office or Remote
Praha, Hlavní město Praha, CZE
729 Employees
In-Office or Remote
Praha, Hlavní město Praha, CZE
201 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account