Engineer II - Site Reliability (Hybrid, IND)

Posted 3 Hours Ago
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka, IND
Hybrid
Mid level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Define your future at CrowdStrike.
The Role
Operate and evolve internal Temporal infrastructure on Kubernetes: deploy updates, automate operational tasks, tune performance and capacity, build observability, participate in on-call incident response, troubleshoot production issues, and help onboard internal teams while growing platform engineering skills.
Summary Generated by Built In

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About This Role:

CrowdStrike's engineering organization depends on shared infrastructure platforms that power critical product capabilities. The Temporal Platform team owns a production workflow orchestration system that serves engineering teams across the organization.

You'll help operate and evolve our internal Temporal infrastructure, a stateful, distributed system running on Kubernetes across multiple regions. The work spans day to day operations, automation, performance tuning and capacity planning. You'll learn how to run complex infrastructure at scale while working alongside experienced platform engineers who will help you grow into broader ownership over time.

This is a growth oriented role. We're looking for someone early in their platform engineering journey who's ready to build operational depth, develop automation skills and understand what it takes to run production infrastructure that teams depend on.

What You'll Do:

  • Operate Temporal infrastructure in production - deploy updates, monitor cluster health, respond to alerts, and maintain availability across multiple environments using Helm, Kubernetes and FluxCD

  • Automate operational work - write scripts and workflows that make deployments, upgrades, scaling operations, and troubleshooting repeatable and safe; reduce manual toil over time

  • Support capacity planning and performance tuning - track resource utilization, identify bottlenecks, tune configuration for better performance and contribute to capacity forecasts under guidance

  • Build observability - instrument services with metrics and logs, improve dashboards, and refine alerting so the team can catch problems before they impact users

  • Contribute to on call rotation - participate in incident response, learn how to triage and escalate issues effectively, write runbooks that help the next person on-call

  • Learn GitOps workflows - work with FluxCD to manage infrastructure-as-code, submit pull requests for configuration changes, and understand how declarative deployment pipelines work

  • Troubleshoot operational issues - investigate deployment failures, connectivity problems, performance degradations, and work with teammates to determine root cause and preventive fixes

  • Partner with consuming teams - help internal engineers onboard to Temporal, answer questions, debug integration issues, and contribute to documentation that makes adoption easier

  • Grow your infrastructure skills - work with PostgreSQL, AWS/GCP, Kubernetes networking, Helm chart management, certificate rotation, secret management and distributed systems operations under mentorship

What You'll Need:

  • 3+ years in DevOps, SRE, platform engineering or infrastructure roles - you've worked on production systems and understand the basics of running services reliably

  • Kubernetes fundamentals - you've deployed services to Kubernetes, understand pods/deployments/services, and can debug basic cluster issues; you don't need deep expertise but should be comfortable navigating kubectl and reviewing YAML

  • Helm experience - you've used Helm to deploy applications, understand charts and values files, and can troubleshoot failed releases

  • Some infrastructure-as-code experience - you've used tools like Terraform, Ansible, or GitOps workflows (FluxCD, ArgoCD) to manage infrastructure declaratively rather than clicking in consoles

  • Cloud platform exposure - you've worked with AWS or GCP in some capacity; you understand basic compute, networking, and storage primitives but don't need to be an expert

  • Scripting ability - you can write scripts (Bash, Python, Go) to automate repetitive tasks and build simple tooling

  • Basic understanding of stateful systems - you've worked with databases (PostgreSQL preferred) or other persistent services and understand backups, schema management, and connection handling at a foundational level

  • Willingness to learn and ask for help - you're comfortable saying "I don't know" and diving into unfamiliar territory with support from teammates

What Success Looks Like:

In your first few months:

  • You can deploy Temporal upgrades across environments with confidence

  • You've automated at least one recurring operational task

  • You respond to on-call pages effectively and write clear incident summaries

  • You've contributed meaningful improvements to dashboards or runbooks

  • Internal teams reach out to you directly for help with Temporal questions

Over your first year:

  • You own end-to-end operations for specific Temporal components or environments

  • You proactively identify performance issues and propose tuning strategies

  • You're contributing to capacity planning and cost optimization discussions

  • You're helping onboard new engineers to the team's operational practices

Bonus Points:

  • Experience operating workflow orchestration platforms (Temporal, Airflow, Prefect, Cadence)

  • Experience with FluxCD or ArgoCD in production

  • Exposure to distributed tracing or observability platforms

  • Go experience (our services and many consuming applications are written in Go)

  • Previous work on internal platform teams or DevOps infrastructure roles

  • Understanding of PostgreSQL performance tuning and operational best practices

  • Familiarity with multi-region infrastructure deployment and failover patterns

#LI-SM2

Benefits of Working at CrowdStrike:

  • Market leader in compensation and equity awards

  • Comprehensive physical and mental wellness programs

  • Competitive vacation and holidays for recharge

  • Paid parental and adoption leaves

  • Professional development opportunities for all employees regardless of level or role

  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections

  • Vibrant office culture with world class amenities

  • Great Place to Work Certified™ across the globe

CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at [email protected] for further assistance.

Skills Required

  • 3+ years in DevOps, SRE, platform engineering, or infrastructure roles
  • Kubernetes fundamentals (deployments, pods, services, kubectl, YAML)
  • Helm experience (charts and values files, troubleshoot releases)
  • Infrastructure-as-code experience (Terraform, Ansible, or GitOps workflows like FluxCD/ArgoCD)
  • Cloud platform exposure (AWS or GCP)
  • Scripting ability to automate tasks (Bash, Python, or Go)
  • Basic understanding of stateful systems and databases (PostgreSQL preferred)
  • Willingness to learn, ask for help, and operate in a growth-oriented environment
  • Experience operating workflow orchestration platforms (Temporal, Airflow, Prefect, Cadence)
  • Experience with FluxCD or ArgoCD in production
  • Exposure to distributed tracing or observability platforms
  • Go experience (systems or application development)
  • PostgreSQL performance tuning and operational best practices
  • Familiarity with multi-region deployment and failover patterns
  • Previous work on internal platform teams or DevOps infrastructure roles

What the Team is Saying

Andrew C.
Lauren P.
Brian P.
Alexa Z.
Theo K.
Sara I.
Lam N.
Lauren B.
Adeeb C.
Kristan C.
Alena C.
Thaddeus M.
Alyssa J.
KT T.

CrowdStrike Compensation & Benefits Highlights

  • Equity Value & Accessibility Equity is emphasized through RSUs and an ESPP with a lookback discount. Feedback suggests these stock programs are considered meaningful parts of total compensation.
  • Healthcare Strength Health coverage encompasses medical, dental, vision, mental‑health resources, and FSAs/HSAs. Feedback suggests these offerings are positioned as comprehensive across official materials and benefit listings.
  • Leave & Time Off Breadth Time off includes generous or “unlimited” PTO, paid holidays, volunteer time, and “Birthday PTO.” Feedback suggests these policies are presented as standard parts of the package.

CrowdStrike Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Austin, TX
10,000 Employees
Year Founded: 2011

What We Do

CrowdStrike has redefined security with the world’s most advanced cloud-native platform that protects and enables the people, processes and technologies that drive modern enterprise. Tested and proven, the world's largest organizations trust CrowdStrike to stop breaches with unparalleled protection against the most sophisticated cyberattacks. The CrowdStrike culture has been built upon our Core Values since the day we began. We are Fanatical About the Customer, Relentlessly Focused on Innovation and believe that our Limitless Passion drives Unlimited Potential for every CrowdStriker. As a purpose-built remote-first company, we believe cultivating a connected culture for every employee, no matter where they are in the world, is a key ingredient in building a high-performing, diverse team. We don’t have a mission statement. We’re on a mission—to stop breaches. Ready to join a mission that matters?

Why Work With Us

We have a culture that celebrates achievement, encourages flexibility and innovation and thrives on teamwork. We all work towards a single mission: to stop breaches. This common goal drives a sense of community and connection among our people across the globe.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

CrowdStrike Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
HQAustin, TX
Osaka
Aarhus, DK
Arlington, VA
Barcelona, ES
Bengaluru, IN
Brussels, BE
Bucharest, RO
Cheltenham, GB
Copenhagen, DK
Dubai, Dubai
Irvine, CA
Kirkland, WA
Minneapolis, MN
Mumbai, IN
New Delhi, IN
Pune, IN
Reading, GB
Riyadh, SA
Saint Louis, MO
Singapore
Sunnyvale, CA
Sydney, Sydney
Tel Aviv-Yafo, IL
Tokyo, Japan
Learn more

Similar Jobs

CrowdStrike Logo CrowdStrike

Sales Development Representative

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Hybrid
2 Locations
10000 Employees

CrowdStrike Logo CrowdStrike

Sales Development Representative

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Hybrid
2 Locations
10000 Employees

CrowdStrike Logo CrowdStrike

Account Executive

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
India
10000 Employees

CrowdStrike Logo CrowdStrike

Back-end Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
10000 Employees
5-5 Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account