Senior Cloud Infrastructure Engineer

Posted 2 Days Ago
Hiring Remotely in United States
Remote
158K-216K Annually
Senior level
Sales • Software
The Role
The Senior Cloud Infrastructure Engineer will build scalable, secure infrastructure, collaborate with ML teams, and ensure system reliability through best practices and observability.
Summary Generated by Built In

Senior Cloud Infrastructure Engineer

About the Role

We’re looking for a Senior Cloud/DevOps Engineer to join Hatch’s high-impact engineering team. This is a senior-level role focused on building resilient, secure, and scalable infrastructure to support both our core platform and AI-powered product lines. You'll partner with engineers, ML practitioners, and product leaders to ensure our systems can scale with the speed of our ambitions.

About Hatch
Hatch is a fast-moving team of builders solving real-world business problems with AI. We move quickly, take ownership, and care deeply about delivering outcomes. Our engineering culture prioritizes operational rigor, clean architecture, and velocity without compromising reliability. If you're energized by scale, speed, and owning infrastructure that powers AI workflows end-to-end — this is a role for you.

What You’ll Do
Infrastructure at Scale
•Evolve our cloud infrastructure (AWS & GCP) using infrastructure-as-code tools like
Terraform or Ansible.
•Implement systems that support the compute-heavy and storage-intensive needs
of machine learning and data processing pipelines.
•Manage scalable, secure, and cost-efficient environments across dev, staging, and
production.
•Participate in an on-call rotation.


ML Platform Support
•Collaborate with ML engineers to productionize models and manage workflows
across training, testing, and deployment stages.
•Implement infrastructure to support versioning, orchestration, and monitoring of
ML models in production (e.g. using tools like Kubeflow, SageMaker, VertexAI, or
custom pipelines).
•Optimize data pipelines and model serving infrastructure for low-latency and high-
throughput performance.


Reliability & Observability
•Drive the strategy for observability, logging, and alerting across distributed
systems.

•Lead incident response, root cause analysis, and system hardening for long-term
resiliency.
•Implement best practices for infrastructure security, container hardening, and
network architecture.


Platform Enablement
•Partner with engineering teams to bake DevOps best practices into the
development lifecycle.
•Build tooling and automation that improves developer velocity, release stability,
and system transparency.


What We’re Looking For
•5+ years of experience in DevOps, SRE, or platform engineering roles in high-
growth environments.
•3+ years of experience with AWS infrastructure and services, including networking,
IAM, ECS/EKS, and serverless computing.
•Strong experience with infrastructure-as-code (Terraform, Ansible) and CI/CD
tooling (GitHub Actions, ArgoCD, etc.).
•Experience supporting machine learning teams or MLOps platforms (e.g. model
training pipelines, feature stores, model registry, online inference).
•Strong knowledge of container orchestration (Kubernetes preferred) and
observability stacks (Prometheus, Grafana, Sentry, DataDog, New Relic, etc.).
•Proven ability to participate in architectural conversations and contribute to large-
scale infrastructure improvements.
•A bias toward simplicity, security, and reliability — you know when to build fast and
when to build right.
•Familiarity with at least one programming language; Python, Go, Erlang, Rust, etc.
•Exposure to agentic programming workflows.
•RHCE, RHCSA, or equivalent certifications preferred.


Why You Should Join
•Work at the intersection of infrastructure and machine learning at a company
building real AI products with urgency and purpose.
•Join a culture that expects technical leadership, fast decision-making, and
relentless curiosity.
•Partner with high-caliber engineers and product leaders in a tight-knit, fast-
executing environment

What We Offer
  • Competitive salary

  • Remote work environment

  • Medical, dental, and vision benefits

  • 401(k) plan + Match

  • Flexible PTO

  • Opportunity to build at the ground floor of a high-growth, mission-driven company

Closing

  • There are a variety of factors that go into determining a salary range, including but not limited to external market benchmark data, geographic location, and years of experience.

  • Based on the anticipated level of experience we are seeking, we expect the compensation range for this role to be between $158,000 and $216,000.

  • We will consider for employment qualified candidates with arrest and conviction records, consistent with applicable law (including, for example, the San Francisco Fair Chance Ordinance for roles based in San Francisco, the Los Angeles County Fair Chance Ordinance for roles based in the unincorporated areas of Los Angeles County, and the California Fair Chance Act for roles based in California).

  • Where required by law, a criminal background check will not be conducted until after a conditional offer of employment is made, and any evaluation of a candidate's criminal background check will be subject to an individualized assessment that takes into account the candidate's specific criminal records and the responsibilities and requirements of the particular role.

Recruiting and Applicant Privacy Notice

Top Skills

Ansible
Argocd
AWS
Datadog
Erlang
GCP
Github Actions
Go
Grafana
Kubeflow
Kubernetes
New Relic
Prometheus
Python
Rust
Sagemaker
Sentry
Terraform
Vertexai
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Richmond, VA
59 Employees
Year Founded: 2016

What We Do

Hatch is an SMS Texting Platform that helps small and medium-size businesses harness the power of two-way SMS messaging.

Similar Jobs

Valon Logo Valon

Infrastructure Engineer

Fintech • Real Estate
Remote or Hybrid
USA
500 Employees
200K-245K Annually

Fingerprint Logo Fingerprint

Infrastructure Engineer

Information Technology • Security • Software • Cybersecurity
Remote
USA
115 Employees
178K-205K Annually

AIS (Applied Information Sciences) Logo AIS (Applied Information Sciences)

Infrastructure Engineer

Cloud • Information Technology • Software • Business Intelligence
In-Office or Remote
2 Locations
710 Employees
102K-154K Annually
Easy Apply
Remote
USA
1535 Employees
90-114 Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account