DevOps Engineer

Posted 2 Days Ago
Be an Early Applicant
27 Locations
Remote
Senior level
Artificial Intelligence • Software
The Role
Design, build, and operate cloud infrastructure and Kubernetes clusters for GPU/ML workloads; implement GitOps, IaC (Terraform), CI/CD, monitoring, automation (Python/Bash), and cost-optimization alongside ML engineers.
Summary Generated by Built In
About Fundamental

Fundamental is an AI company pioneering the future of enterprise decision-making. Founded by DeepMind alumni, Fundamental has developed NEXUS – the world's most powerful Large Tabular Model (LTM) – purpose-built for the structured records that actually drive enterprise decisions. Backed by world class investors and trusted by Fortune 100 companies, Fundamental unlocks trillions of dollars of value by giving businesses the Power to Predict.

At Fundamental, you'll work on unprecedented technical challenges in foundation model development and build technology that transforms how the world's largest companies make decisions. This is your opportunity to be part of a category-defining company from the ground-up. Join the team defining the future of enterprise AI.

Key responsibilities
  • Design and implement cloud infrastructure from the ground up

  • Build and maintain Kubernetes clusters optimized for GPU workloads and ML applications, as well as Production SaaS hosting

  • Implement GitOps practices using ArgoCD for continuous deployment

  • Develop infrastructure as code using Terraform

  • Create and maintain CI/CD pipelines for infrastructure and application deployment

  • Implement monitoring and observability solutions for distributed systems

  • Automate infrastructure management with Python and Bash

  • Collaborate with ML engineers to optimize infrastructure for model training and serving

  • Implement and maintain cost optimization strategies (FinOps) for cloud resources

  • Monitor and optimize cloud spending, especially for GPU-intensive workloads

Must have
  • 5+ years of experience in cloud infrastructure and DevOps

  • 3+ years of experience with Python

  • Strong experience with AWS and GCP cloud platforms

  • Deep expertise in Kubernetes, including multi-cluster management, GPU workload optimization, resource scheduling and autoscaling, and network policies and security

  • Experience with GitOps tools (ArgoCD preferred)

  • Extensive experience with cloud networking, including VPC design, load balancer configuration, network security and segmentation, and cross-cloud networking solutions

  • Strong CI/CD expertise, preferably with GitHub Actions

  • Proficiency in infrastructure as code (Terraform)

  • Experience with monitoring and observability tools

  • Experience with FinOps practices and cloud cost optimization

Nice to have
  • Experience with ML workflow tooling (MLflow, Kubeflow, or similar)

  • Experience with FastAPI and Backend applications

  • Familiarity with data platforms like Databricks or Snowflake

  • Exposure to SRE practices or cloud security certifications

  • Hands-on experience with Prometheus, Grafana, or Datadog

Benefits
  • Competitive compensation with salary and equity

  • Comprehensive health coverage for you and your dependents

  • Paid parental leave for all new parents, inclusive of adoptive and surrogate journeys

  • Relocation support for employees moving to join the team in one of our office locations

  • A mission-driven, low-ego culture that values diversity of thought, ownership, and bias toward action

Skills Required

  • 5+ years of experience in cloud infrastructure and DevOps
  • 3+ years of experience with Python
  • Strong experience with AWS and GCP cloud platforms
  • Deep expertise in Kubernetes, including multi-cluster management, GPU workload optimization, resource scheduling and autoscaling, and network policies and security
  • Experience with GitOps tools (ArgoCD preferred)
  • Extensive experience with cloud networking, including VPC design, load balancer configuration, network security and segmentation, and cross-cloud networking solutions
  • Strong CI/CD expertise, preferably with GitHub Actions
  • Proficiency in infrastructure as code (Terraform)
  • Experience with monitoring and observability tools
  • Experience with FinOps practices and cloud cost optimization
  • Experience with ML workflow tooling (MLflow, Kubeflow, or similar)
  • Experience with FastAPI and Backend applications
  • Familiarity with data platforms like Databricks or Snowflake
  • Exposure to SRE practices or cloud security certifications
  • Hands-on experience with Prometheus, Grafana, or Datadog
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
54 Employees
Year Founded: 2024

What We Do

For decades companies have relied on archaic tools to inform decisions and make bets on the future. Until now. Fundamental empowers businesses to turn gambles into guarantees and determine their future with far greater accuracy than ever before. Built by DeepMind alumni and trusted by Fortune 100 enterprises, NEXUS is our most powerful Large Tabular Model (LTM). By revealing the hidden language of tables, NEXUS unlocks trillions of dollars of value by giving businesses the Power to Predict™.

Similar Jobs

Satori Analytics Logo Satori Analytics

Devops Engineer

Artificial Intelligence • Information Technology • Machine Learning • Software • Analytics
Remote
Greece
94 Employees

Sphynx Technology Solutions Logo Sphynx Technology Solutions

Devops Engineer

Software • Analytics • Cybersecurity
In-Office or Remote
2 Locations
38 Employees

EUROPEAN DYNAMICS Logo EUROPEAN DYNAMICS

Devops Engineer

Information Technology • Consulting
In-Office or Remote
2 Locations
765 Employees

RevenueCat Logo RevenueCat

Senior DevOps / DevEx Engineer

Fintech • Mobile • Payments • Software
Remote
44 Locations
40 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account