Software Development Engineer III - Infrastructure

Reposted 17 Days Ago
Be an Early Applicant
Hiring Remotely in Delhi, Connaught Place, New Delhi, Delhi
In-Office or Remote
Mid level
Information Technology • Internet of Things • Marketing Tech
The Role
Join the Core Infrastructure SRE Operations & Security team to manage production systems, incident response, reliability engineering, and security operations. Collaborate with various teams to ensure system stability and improve operational processes.
Summary Generated by Built In
About HighLevel:
HighLevel is an AI powered, all-in-one white-label sales & marketing platform that empowers agencies, entrepreneurs, and businesses to elevate their digital presence and drive growth. We are proud to support a global and growing community of over 2 million businesses, comprised of agencies, consultants, and businesses of all sizes and industries. HighLevel empowers users  with all the tools needed to capture, nurture, and close new leads into repeat customers. As of mid 2025, HighLevel processes over 4 billion API hits and handles more than 2.5 billion message events every day. Our platform manages over 470 terabytes of data distributed across five databases, operates with a network of over 250 microservices, and supports over 1 million hostnames.

Our People
With over 1,500 team members across 15+ countries, we operate in a global, remote-first environment. We are building more than software; we are building a global community rooted in creativity, collaboration, and impact. We take pride in cultivating a culture where innovation thrives, ideas are celebrated, and people come first, no matter where they call home.

Our Impact
As of mid 2025, our platform powers over 1.5 billion messages, helps generate over 200 million leads, and facilitates over 20 million conversations for the more than 2 million businesses we serve each month. Behind those numbers are real people growing their companies, connecting with customers, and making their mark - and we get to help make that happen.

About the Role:
We are seeking SDE3 engineers to join HighLevel’s Core Infrastructure SRE Operations & Security team. This role focuses on operating, securing, and improving HighLevel’s production infrastructure, with responsibilities spanning on-call operations, incident response, reliability engineering, and security remediation.

You will work closely with Cloud Infrastructure, Platform Engineering, Data Infrastructure, and Security teams to ensure systems are stable, resilient, and secure. This is a hands-on role with a strong operational and security mindset, critical to HighLevel’s platform maturity.

Responsibilities:

Production Operations & Reliability:
-> Participate in 24/7 on-call rotations for core infrastructure systems
-> Execute incident response during production events, including triage, mitigation, and recovery
-> Maintain and improve runbooks, operational procedures, and escalation paths
-> Help reduce MTTR and prevent repeat incidents through engineering solutions

Infrastructure Reliability Engineering:
->Improve reliability of core infrastructure components including: Kubernetes (GKE) clusters, Cloud networking and load balancing & Edge services (Cloudflare)
-> Identify systemic reliability issues and drive corrective actions
-> Support capacity planning, scaling, and resilience testing

Security Operations & Remediation:
-> Execute security remediations across cloud and Kubernetes environments
-> Support enforcement of: IAM least-privilege access, Network security controls & Runtime security policies
-> Partner with Platform Security on vulnerability management and remediation
-> Support security incident response and post-incident reviews

Automation & Tooling:
-> Automate repetitive operational and security tasks
-> Build tooling to improve:Incident response speed, Operational visibility & Security posture enforcement
-> Reduce manual toil through scripts, tooling, and process improvements

Change Management & Governance:
-> Support safe execution of infrastructure and configuration changes
-> Ensure changes follow defined change management and audit requirements
-> Contribute to incident reviews, postmortems, and continuous improvement initiatives

Collaboration & Growth:
-> Work closely with Cloud Infrastructure, SRE, Platform, Data, and Security teams
-> Contribute to shared documentation and operational standards
-> Mentor junior engineers and lead small reliability or security initiatives

Requirements:

  • 4+ years of experience operating large-scale systems
  • Experience with GCP or other public cloud platforms
  • Experience with Kubernetes (GKE) in production
  • Ability to identify systemic issues and propose long-term fixes
  • Experience leading incident response or reliability initiatives
  • Strong understanding of reliability, security, and operational best practices
  • Comfortable working in on-call and incident response environments
  • Strong troubleshooting and communication skills
  • Experience supporting or operating production systems
  • Comfortable mentoring junior engineers and influencing peers

Nice to have:

  • Familiarity with Cloudflare, networking, or edge security
  • Exposure to security tooling or vulnerability management
  • Scripting or automation experience (Python, Go, Bash, etc.)
  • Experience in compliance- or audit-driven environments (SOC2, ISO)

EEO Statement:
The company is an Equal Opportunity Employer. As an employer subject to affirmative action regulations, we invite you to voluntarily provide the following demographic information. This information is used solely for compliance with government record-keeping, reporting, and other legal requirements. Providing this information is voluntary and refusal to do so will not affect your application status. This data will be kept separate from your application and will not be used in the hiring decision.

#LI-Remote #LI-NJ1

Top Skills

Bash
Cloudflare
Gke
Go
Kubernetes
Python
Security Tooling
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Dallas, Texas
974 Employees
Year Founded: 2018

What We Do

https://www.gohighlevel.com/quick-links

One white-labeled marketing app to rule them all. HighLevel is everything your business needs to succeed!

Capture leads using our landing pages, surveys, forms, calendars, inbound phone system & more!

Automatically message leads via voicemail, forced calls, SMS, emails, FB Messenger & more!

Use our built in tools to collect payments, schedule appointments, and track analytics

Similar Jobs

SailPoint Logo SailPoint

Customer Success Manager

Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Remote or Hybrid
India
2461 Employees

Samsara Logo Samsara

Mid-market Account Executive

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
IN
4000 Employees
152K-190K Annually

Ericsson Logo Ericsson

Technical Lead

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office or Remote
118 Locations
89000 Employees

Coinbase Logo Coinbase

Engineering Manager

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Easy Apply
Remote
India
4700 Employees
9M-9M Annually

Similar Companies Hiring

ClickMint Thumbnail
Marketing Tech • Generative AI • eCommerce • AdTech
Malibu, CA
9 Employees
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account