Platform Support Engineer

Posted 8 Days Ago
Hiring Remotely in India
Remote
Mid level
Big Data • Software • Analytics
The Role
Seeking a Cloud Platform Engineer to design, deploy, and operate Kubernetes clusters across multiple cloud platforms. Responsibilities include optimizing cluster performance, building cloud-native infrastructure, implementing automation, and ensuring security and compliance.
Summary Generated by Built In

We're seeking a versatile Cloud Platform Engineer passionate about building and maintaining a highly reliable, scalable, and cloud-native infrastructure. You'll be vital in bridging the gap between development, operations, and SRE, ensuring our applications run smoothly on Kubernetes across multiple cloud platforms. Your deep understanding of Kubernetes, cloud technologies, and automation will be instrumental in empowering our teams to deliver high-quality software quickly and reliably.

What will you do?

  • Design, deploy, and operate Kubernetes clusters across AWS, Azure, and GCP. Optimize cluster performance, ensure high availability, and implement robust security practices.
  • Build and maintain cloud-native infrastructure components (load balancers, networking, storage, etc.) to support applications running on Kubernetes. Leverage Infrastructure as Code (IaC) with Terraform to automate and manage infrastructure provisioning and configuration.
  • Embrace GitOps principles using ArgoCD to automate deployments and configuration changes and ensure consistency between the desired and actual system state.
  • Establish comprehensive monitoring, logging, and alerting systems to gain insights into platform health and performance. Troubleshoot incidents swiftly and apply SRE principles to improve reliability and resilience.
  • Develop automation scripts and tools (Python, Go, or other languages) to streamline workflows, eliminate manual tasks, and reduce operational overhead.
  • Partner closely with development teams to understand their needs, provide guidance on platform best practices, and enable smooth integration and deployment of their applications.
  • Implement and maintain stringent security measures for Kubernetes and cloud environments, ensuring compliance with industry standards and data protection regulations.
  • Analyze resource usage and implement optimization strategies to maximize performance while controlling cloud costs.
  • Participate in an on-call rotation, troubleshooting and resolving production issues promptly.

What makes you a match?

  • 3+ years of experience working with Kubernetes in production environments. Deep understanding of cluster operations, networking, storage, and security within Kubernetes.
  • Strong knowledge of AWS, Azure, and GCP, including core services, networking concepts, and security best practices.
  • Proven experience implementing GitOps workflows with ArgoCD and managing infrastructure using Terraform.
  • Fluency in at least one programming language (Python, Go, Java) for automation, scripting, and tool development.
  • Familiarity with SRE practices like SLOs (Service Level Objectives), error budgeting, and blameless postmortems.
  • Excellent analytical and troubleshooting skills to identify and resolve issues in complex cloud environments.
  • Ability to communicate effectively with development, operations, and security teams to drive cross-functional initiatives.
  • Ability to work from 8.30 PM to 5.30 AM IST to provide coverage for US time zones.

Top Skills

Go
Python
The Company
HQ: New York, NY
192 Employees
On-site Workplace
Year Founded: 2018

What We Do

Built by a data team for data teams, Atlan is the active metadata platform for the modern data stack. It stitches together metadata from various sources (Snowflake, dbt, Databricks, Looker, Tableau, Postgres, etc.) to create a unified data discovery, cataloging, lineage, and governance experience across all your data assets, from columns and queries to metrics and dashboards. Atlan facilitates a two-way movement of metadata, bringing context back into the tools and workflows that your data team uses every day — for example, in your BI tool when you wonder what a metric on the dashboard means.

A pioneer in the space, Atlan was named a Leader in Forrester Wave™️: Enterprise Data Catalogs for DataOps in 2022 and was recognized by Gartner seven times in 2021, including as a Cool Vendor in DataOps and in the inaugural Market Guide for Active Metadata Management. Today, we power pioneering data teams like WeWork, Plaid, Postman, Unilever, and Ralph Lauren. We recently raised a Series B, backed by top investors (including Insight Partners, Sequoia, and Salesforce Ventures) and founders & CEOs from the modern data stack (including Snowflake, Looker, and Stitch).

For more information, visit http://www.atlan.com/ or follow us on Twitter at AtlanHQ.

Similar Jobs

Remote
8 Locations
14000 Employees

Course Hero Logo Course Hero

Principal Software Development Engineer - Core Tools (Quillbot)

Edtech • Machine Learning • Social Impact • Software
Easy Apply
Remote
India
201 Employees

BlackLine Logo BlackLine

Sr. Software Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Remote
Hybrid
Bengaluru, Karnataka, IND
1810 Employees

FourKites Logo FourKites

Staff Software Engineer

Artificial Intelligence • Big Data • Logistics • Software • Transportation
Easy Apply
Remote
Chennai, Tamil Nadu, IND
550 Employees

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account