Compute Infrastructure Specialist

Posted 24 Days Ago
Be an Early Applicant
San Francisco, CA, USA
In-Office
Mid level
Artificial Intelligence • Machine Learning • Software
The Role
Manage GPU infrastructure, coordinate internal and customer deployments, monitor performance, and collaborate with various teams while providing operational support.
Summary Generated by Built In
Compute Infra SpecialistRole Overview

We’re looking for a highly operational, technically savvy Compute Infra Specialist to help manage and scale the infrastructure that powers our AI workloads and customer deployments. This person will sit at the intersection of engineering, operations, and customer delivery, helping ensure GPU resources are efficiently allocated, deployments run smoothly, and customers have a strong experience using our infrastructure.

This is a hands-on role for someone who enjoys solving infrastructure problems, coordinating across teams, and working directly with cutting-edge AI systems.

Responsibilities
  • Manage and track GPU/compute inventory across internal and customer environments
  • Coordinate infrastructure provisioning for customer deployments and internal research workloads
  • Monitor utilization, capacity, uptime, and cost efficiency across compute environments
  • Work cross-functionally with Engineering, Research, Product, and GTM teams on deployment readiness and customer needs
  • Support customer onboarding and infrastructure troubleshooting alongside Solutions and Customer Success teams
  • Maintain documentation around infrastructure processes, environments, and deployment standards
  • Help improve operational workflows around provisioning, monitoring, escalation management, and forecasting
  • Partner with vendors and cloud providers as needed
  • Assist with infrastructure planning related to scaling customer demand and new product launches
Qualifications
  • Experience working with cloud infrastructure, GPU environments, or AI/ML infrastructure operations
  • Familiarity with Kubernetes, Linux environments, containerization, or cloud platforms (AWS/GCP/Azure)
  • Strong operational and project management instincts
  • Comfortable working cross-functionally in a fast-moving startup environment
  • Ability to communicate technical concepts clearly to both technical and non-technical stakeholders
  • Strong organizational skills and attention to detail
Nice to Have
  • Experience supporting AI/ML workloads or model deployment infrastructure
  • Experience working directly with enterprise customers
  • Startup experience

Skills Required

  • Experience working with cloud infrastructure, GPU environments, or AI/ML infrastructure operations
  • Familiarity with Kubernetes, Linux environments, containerization, or cloud platforms (AWS/GCP/Azure)
  • Strong operational and project management instincts
  • Ability to communicate technical concepts clearly
  • Strong organizational skills and attention to detail
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
48 Employees
Year Founded: 2023

What We Do

Arcee AI delivers purpose-built AI agents, powered by industry-leading small language models (SLMs) for enterprise applications. Their offering, Arcee Orchestra, is an end-to-end agentic AI solution that enables businesses to create AI agents for complex tasks. The solution makes it easy to build custom AI workflows that automatically route tasks to specialized SLMs to deliver detailed, trustworthy responses, fast.

Similar Jobs

CoreWeave Logo CoreWeave

Field Engineer

Cloud • Information Technology • Machine Learning
In-Office
6 Locations
1450 Employees
188K-275K Annually

Klaviyo Logo Klaviyo

Senior Solutions Architect

Consumer Web • eCommerce • Marketing Tech • Retail • Software • Analytics • Generative AI
Easy Apply
Hybrid
Los Angeles, CA, USA
2400 Employees
112K-168K Annually

Klaviyo Logo Klaviyo

Senior Solutions Architect

Consumer Web • eCommerce • Marketing Tech • Retail • Software • Analytics • Generative AI
Easy Apply
Hybrid
San Diego, CA, USA
2400 Employees
112K-168K Annually
Hybrid
San Mateo, CA, USA
205000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account