Staff Software Engineer, Compute

Reposted 8 Days Ago
Be an Early Applicant
San Francisco, CA
In-Office
180K-250K
Expert/Leader
Cloud • Digital Media • Information Technology
Generative media platform for developers.
The Role
As a Staff Software Engineer, you will develop and maintain a core Python platform for managing computation workloads and cloud infrastructure, while ensuring system reliability and scalability.
Summary Generated by Built In

You are an experienced software engineer who thrives on building large scale computation platforms. You have deep expertise in backend systems that orchestrate workloads and route requests efficiently, while taking care of capacity and resource constraints. You possess a strong understanding of foundational cloud infrastructure and Linux provisioning and management tools. You know how to achieve reliability and scale with minimum operational load.

Key responsibilities
  • Develop and maintain our core Python platform, which handles routing of requests, orchestration of AI workloads, GPU server capacity management, observability, authentication, rate limiting, and many others

  • Develop and maintain our infrastructure layer where we use Terraform, Ansible, and provider APIs to manage our fleet of GPU workers

  • Own K8s, FluxCD, Nomad, Prometheus, Thanos, Grafana, Loki, distributed networking storage, and other technologies that underpin our platform

  • Create the vision and lay the foundation for where our infrastructure should go in the next 1/2/5 years

Requirements
  • Deep experience building distributed compute platforms, preferably with Python

  • Strong foundation in managing both cloud and bare metal infrastructure

  • Solid understanding of K8s and CI/CD on it

  • Excellent communication

  • Self-starter who executes quickly, takes ownership and constantly seeks improvement

Compensation
  • $180,000-250,000 plus equity
Location
  • San Francisco, CA

What we offer at fal
  • Interesting and challenging work

  • Employee-friendly equity terms (early exercise, extended exercise)

  • A lot of learning and growth opportunities

  • We are currently hiring in downtown San Francisco.

  • We offer visa sponsorship and will help you relocate to San Francisco.

  • Health, dental, and vision insurance (US)

  • Regular team events and offsites

Top Skills

Ansible
Fluxcd
Grafana
Kubernetes
Loki
Prometheus
Python
Terraform
Thanos
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
73 Employees

What We Do

Generative Media Cloud

Similar Jobs

NVIDIA Logo NVIDIA

Staff Software Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
2 Locations
21960 Employees
168K-265K

ZS Logo ZS

Consultant

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
3 Locations
13000 Employees
143K-175K Annually

Atlassian Logo Atlassian

Software Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
11000 Employees
123K-193K Annually

Atlassian Logo Atlassian

Software Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
In-Office or Remote
San Francisco, CA, USA
11000 Employees
123K-193K Annually

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account