Sr. Cloud Engineer (Software-Focused)

Reposted 24 Days Ago
San Francisco, CA
In-Office
Senior level
Artificial Intelligence • Machine Learning • Software
The Role
The Senior Cloud Engineer will maintain and improve cloud infrastructure, support software deployments, enhance CI/CD pipelines, and ensure system reliability for various cloud providers.
Summary Generated by Built In

Location: Remote or SF, CA
Department: Engineering
Reports to: Engineering Lead

About the Role

We’re looking for a Senior Cloud Engineer with a software engineering background to help build, scale, and support the infrastructure powering our applications. This is a hands-on role ideal for someone who enjoys working with Kubernetes and related tools across multiple cloud providers, and is excited to grow in a dynamic, fast-paced environment.

As a member of the Cloud team, you will work closely with software engineers and product teams to build platforms, support deployments, improve reliability, reduce costs, and help us scale our systems.

What You’ll Do
  • Work with the cloud team to maintain and improve the existing Kubernetes and AWS infrastructure.
  • Work with the software development and research teams to help them architect their applications and deploy them to the cloud.
  • Help build a platform to run our products inside private customer cloud environments in AWS, Azure, and GCP.
  • Support and improve CI/CD pipelines, automated deployments.
  • Support and improve an existing LGTM Observability stack.
  • Write clean, maintainable scripts and tooling in Python, Go, or similar languages.
  • Contribute to the design and automation of scalable, resilient, and secure systems
  • Help triage and resolve infrastructure-related issues in staging and production environments
  • Participate in on-call rotation (as needed) and contribute to system reliability initiatives
  • Assist with SOC2 audits.
Tools you should know well
  • AWS (Control Tower, Identity Center, VPC, and more)
  • Kubernetes (EKS)
  • Teraform and Terragrunt
  • Helm
  • Docker
  • ArgoCD
  • LGTM (Loki, Grafana, Temp, Mimir)
  • Python
Nice to have skills, but not required.
  • Experience working with research and software teams.
  • Experience using or designing agentic systems.
  • Experience with using LLMs or building systems that use LLMs.
  • Experience running GPU workloads on Kubernetes would be a huge plus.
Nice to have tools, but not required.
  • Pulumi
  • Atlantis
  • Zitadel
  • GCP
  • Azure
  • Google Workspaces
  • Fivetran
  • Cloudflare
  • Snowflake
  • Postgresql
  • Tailscale
  • Javascript/Typescript
  • Docker compose
  • Redis / Valkey
Why Join Us?

At Arcee, we’re building the infrastructure powering the next generation of intelligent systems, and we’re doing it with a team that values curiosity, ownership, and thoughtful collaboration.

  • Work on high-impact problems: You’ll tackle real infrastructure challenges that support AI research, agentic systems, and production ML workflows across AWS, Azure, and GCP.

  • Join a sharp, mission-driven team: Our engineers are deeply technical and collaborative, and we care about doing things the right way, not just the fast way.

  • Grow with autonomy and impact: We’re still small, which means your voice matters. You’ll shape strategy, ship real things, and see your work in action.

  • Remote-first, with roots in SF: We support remote work and async collaboration, and we’re opening an office in San Francisco for those who prefer a hybrid setup.

  • Take the time you need: We offer unlimited PTO and US bank holidays and we genuinely want you to take it. Rested teams do better work.

  • Be part of something future-facing: Our work directly supports large language models and intelligent agents. You'll be at the intersection of infrastructure and innovation.

Top Skills

Argocd
AWS
Docker
Helm
Kubernetes
Python
Teraform
Terragrunt
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
48 Employees
Year Founded: 2023

What We Do

Arcee AI delivers purpose-built AI agents, powered by industry-leading small language models (SLMs) for enterprise applications. Their offering, Arcee Orchestra, is an end-to-end agentic AI solution that enables businesses to create AI agents for complex tasks. The solution makes it easy to build custom AI workflows that automatically route tasks to specialized SLMs to deliver detailed, trustworthy responses, fast.

Similar Jobs

Anduril Logo Anduril

Buyer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
76K-114K Annually

Anduril Logo Anduril

Technical Operations Engineer - Space Systems

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
112K-168K Annually

Anduril Logo Anduril

Test Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
In-Office
Costa Mesa, CA, USA
170K-215K Annually

CoreWeave Logo CoreWeave

Security Risk Management Analyst

Cloud • Information Technology • Machine Learning
In-Office
4 Locations
122K-237K

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
15 Employees
Compa Thumbnail
Software • Other • HR Tech • Business Intelligence • Artificial Intelligence
Irvine, CA
48 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account