Senior DevOps Engineer

Sorry, this job was removed at 04:12 p.m. (CST) on Friday, Jul 25, 2025
Be an Early Applicant
Hiring Remotely in São Paulo
In-Office or Remote
Artificial Intelligence • Machine Learning • Software
The Role
Engineering at TRACTIAN

The Engineering team at TRACTIAN builds and operates the cloud-native backbone that powers our industrial IoT platform. We design for massive scale, high reliability, and security across AWS, Azure AKS, and Oracle Cloud (OCI) Kubernetes clusters.

What you'll do

- Own end-to-end delivery pipelines—from GitHub commit to production—running on GitHub Actions, ECS Fargate, AKS, and OCI Kubernetes.
- Evolve our multi-cloud, multi-cluster architecture (AWS + OCI) with zero-trust networking.
- Write and maintain IaC (Terraform + Terragrunt), Helm charts, and Kubernetes operators to automate everything.
- Optimize observability: build dashboards/alerts using Grafana OSS stack, Prometheus, Loki, Tempo, and Datadog.
- Troubleshoot complex incidents involving microservices, monoliths in containers, and AI workloads on GPU nodes.
- Improve security posture—harden images, manage secrets, enforce policies, and audit compliance.
- Help other engineers on DevOps best practices and drive continuous improvement.

Responsibilities

  • Apply DevOps practices to increase deployment speed, security, and quality.
  • Architect and run CI/CD workflows in GitHub Actions (matrix builds, reusable workflows, OIDC federation).
  • Design, build, and maintain Terraform/Terragrunt modules for VPCs, subnets, security groups, side-to-side VPNs, and private links.
  • Manage container orchestration on ECS Fargate and Kubernetes (AWS & OCI) with Helm, Keda.
  • Implement autoscaling, blue-green / canary releases, and cost-optimization for GPU and CPU workloads.
  • Diagnose performance bottlenecks across network, compute, storage, and application layers.
  • Maintain high-quality documentation.

Requirements

  • B.S. in Computer Engineering, Information Systems, or equivalent experience.
  • Strong scripting skills (Python, Bash); Go or Rust a plus.
  • Hands-on CI/CD with GitHub Actions and experience running production workloads on:
  • AWS: ECS Fargate, S3, RDS, CloudWatch, VPC networking.
  • Kubernetes: OCI OKE, Helm, Istio, Keda.
  • IaC expertise with Terraform and Terragrunt in multi-account/multi-cloud setups.
  • Solid networking foundations: VPC design, subnets, routing, VPN/IPSec tunnels, security groups, load balancers.
  • Observability stack experience (Grafana, Prometheus, Loki, Tempo, Datadog).
  • Familiarity with container security, SBOMs, image scanning, secret management, and least-privilege IAM.
  • Excellent problem-solving skills, ownership mindset, and ability to work autonomously within a distributed team.

Similar Jobs

Motorola Solutions Logo Motorola Solutions

Senior Devops Engineer

Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Remote or Hybrid
Brazil
23000 Employees

CSG Logo CSG

Senior Devops Engineer

Internet of Things • On-Demand • Payments • Software
Remote
2 Locations
5774 Employees

Truelogic Software Logo Truelogic Software

Senior Devops Engineer

Information Technology • Software
Remote
Brazil
266 Employees
In-Office or Remote
São Paulo, BRA
264 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Atlanta, , Georgia
103 Employees
Year Founded: 2019

What We Do

Tractian is a machine intelligence company that offers industrial monitoring systems. Tractian builds streamlined hardware-software solutions to give maintenance technicians and industrial decision-makers comprehensive oversight of their operations. It is democratizing access to sophisticated real-time monitoring and asset operations tools.

Tractian's solutions are used in environments that address a combined total of 5% of global industrial output. The company’s broad market reach is evidenced in its customer base from various industries, such as John Deere, Procter & Gamble, Caterpillar, Goodyear, Carrier, Johnson Controls, and Bimbo, the owner of the brands Little Bites and Thomas Bagels. Tractian's customers see a 6-12x ROI with savings of $6,000 per monitored machine annually on average.

In a major milestone and a first for the industry, Tractian launched the AI-Assisted Maintenance category in the industrial sector. In this new paradigm, artificial intelligence identifies machine problems and suggests preventive actions to be taken, giving invaluable insight and support to maintenance professionals. It is important to highlight that the intent of Assisted Maintenance is firmly rooted in augmenting maintenance professionals to provide more assertive diagnosis with human-in-the-loop feedback.

Tractian's mission is to elevate this category of workers in a highly impactful way. The Assisted Maintenance category will provide unimaginable support for maintenance professionals. By combining shop floor expertise with our technology, maintainers will be able to anticipate and address issues with unprecedented accuracy and speed

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account