The Role
Design, implement, and secure scalable cloud infrastructure across AWS and OCI; manage CI/CD (GitHub Actions), Kubernetes and ECS clusters (including GPU workloads); integrate observability (Datadog, Grafana, OpenTelemetry, Sentry); enforce security and compliance (SOC2, ISO 27001); handle DevOps/SRE intake, vulnerability response, and migrations from EKS to OKE.
Summary Generated by Built In
Senior Cloud Engineer at TRACTIAN
What You Will Do
Compensation & Benefits
In a data-driven company like TRACTIAN, the Cloud Engineering team is essential for maintaining robust, secure, and scalable cloud infrastructures. This team implements automation, security practices, and rigorous protocols to safeguard our digital assets and data infrastructure across diverse cloud environments. The Cloud Engineering team plays a crucial role in our internal operations and client solutions by ensuring continuous integration, secure deployments, and advanced observability.
As a Senior Cloud Engineer, you will be responsible for contributing to a technical team, safeguarding the company's cloud infrastructure primarily on AWS and OCI, with occasional projects involving GCP and Azure. Your role involves implementing state-of-the-art infrastructure solutions, embedding robust security measures, and ensuring efficient deployment processes. This position requires deep technical expertise and a hands-on approach to infrastructure automation, security integration, and observability.
Responsibilities:
- Architect, implement, and secure scalable cloud infrastructure on AWS, OCI, and occasionally GCP/Azure.
- Oversee CI/CD pipelines, enhancing them through GitHub Actions and GitHub Enterprise.
- Maintain and optimize Kubernetes clusters and AWS ECS environments, including GPU infrastructure management.
- Embed comprehensive security measures, integrating advanced security tools and practices proactively.
- Implement observability and monitoring solutions with Datadog, Grafana, OpenTelemetry, and Sentry.
- Utilize Jira effectively for project management and issue tracking.
- Collaborate closely with other engineering teams to drive secure and efficient development practices.
- Address vulnerabilities, security incidents, and tickets promptly and proactively.
- Field DevOps / SRE intake queues.
- Execute Kubernetes service migrations from AWS (EKS) to OCI (OKE), ensuring workload compatibility, stability, and minimal disruption.
Requirements:
- 5+ years of hands-on experience in Cloud Engineering, DevSecOps, or similar roles.
- Extensive knowledge of AWS and OCI; familiarity with GCP/Azure preferred.
- Strong working knowledge of Kubernetes (k8s), including cluster management, pod architecture, and GPU-based workloads; CKA or CKAD certification a plus.
- Expert in Terraform (primary IaC tool), Helm, Docker, and AWS ECS.
- Strong experience with GitHub Actions, GitHub Enterprise, and Cloudflare.
- Proficiency in monitoring tools including Datadog, Grafana, OpenTelemetry, and Sentry.
- Solid understanding of security best practices and compliance frameworks including SOC2 and ISO 27001.
- Strong scripting skills in Python, Bash, or PowerShell for automation purposes.
- Docker Kompose experience a plus.
Preferred Qualifications:
- Certifications in AWS, OCI, Kubernetes (CKA, CKAD), or relevant cloud engineering certifications.
- Prior experience in high-growth tech environments.
Why Join Us:
- Opportunity to lead and directly influence infrastructure and security strategy.
- Innovative and challenging technical environment.
- Continuous learning and career growth opportunities.
- Competitive Salary
- Premium Medical, Dental, and Vision Coverage
- Paid Time Off (PTO): 15 Days
- 401(k) Retirement Plan
- Wellhub Membership - Access a wide range of gyms and training programs.
- Sports Incentive - Receive a monthly bonus when you regularly participate in physical activities.
- Long-Term Benefit - After four years of service, earn a fully funded trip anywhere in the world.
Skills Required
- 5+ years hands-on Cloud Engineering, DevSecOps, or similar roles
- Extensive knowledge of AWS
- Extensive knowledge of OCI
- Familiarity with GCP
- Familiarity with Azure
- Strong working knowledge of Kubernetes, including cluster management and GPU workloads
- CKA or CKAD certification
- Expertise in Terraform
- Experience with Helm
- Experience with Docker
- Docker Kompose experience
- Experience with AWS ECS
- Experience managing GPU infrastructure
- Experience with GitHub Actions and GitHub Enterprise
- Experience with Cloudflare
- Proficiency with Datadog
- Proficiency with Grafana
- Proficiency with OpenTelemetry
- Proficiency with Sentry
- Experience using Jira for project management and issue tracking
- Knowledge of security best practices and compliance frameworks including SOC2 and ISO 27001
- Strong scripting skills in Python, Bash, or PowerShell
- Ability to execute Kubernetes service migrations from EKS to OKE
- Certifications in AWS, OCI, or Kubernetes
- Prior experience in high-growth tech environments
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Tractian is a machine-intelligence company delivering integrated hardware, cloud software and AI to prevent machine failures and boost industrial uptime. Their offering combines vibration and condition sensors, TracOS maintenance-management software, and AI-driven analytics to enable predictive maintenance, energy optimization and operational visibility for factories and asset-heavy operations globally.

.png)







