AI Infrastructure Engineer

Posted 9 Hours Ago
Hiring Remotely in United States
Remote
170K-210K Annually
Senior level
Artificial Intelligence • Software • Energy • Utilities
The Role
The AI Infrastructure Engineer designs and builds infrastructure for AI and ML models across various environments, optimizing performance and reliability.
Summary Generated by Built In
Utilidata is a fast-growing NVIDIA-backed edge AI company enabling greater visibility and control of power utilization in energy-intensive infrastructure, like the electric grid and data centers. Karman, the company’s distributed AI platform powered by a custom NVIDIA module, is transforming the way utility companies operate the grid edge and will enable data centers to unlock more compute for the same provisioned power.
The AI Infrastructure Engineer is responsible for designing, building, and owning the end-to-end infrastructure that serves Utilidata's AI and ML models across edge deployments, cloud environments, and data center integrations. They are also responsible for designing, building, and owning the integration of power data with AI inference software.  This is Utilidata's first dedicated role of this kind, and will serve as the foundational function for how the company deploys and operates AI capabilities in production. The role requires deep technical expertise in ML model serving, distributed systems, and GPU infrastructure, with a strong emphasis on reliability, performance, and scalability. This position works cross-functionally with product, engineering, and data science teams and is open to fully remote candidates, with periodic travel expected for company retreats and key on-site engagements.
Responsibilities
  • Lead the design and build of Utilidata's AI inference platform — establishing architecture patterns, deployment standards, and operational practices that will scale with the company
  • Own end-to-end model serving infrastructure for Utilidata's AI infrastructure (on-prem and datacenter) 
  • Build and maintain fault-tolerant, high-performance systems for serving AI models at scale, with a focus on low latency, reliability, and cost efficiency
  • Collaborate closely with algorithms engineers to integrate AI inference data and configuration with power optimization algorithms 
  • Optimize GPU utilization and inference performance across our hardware fleet, including NVIDIA accelerators central to Utilidata's edge AI platform
  • Establish MLOps best practices including CI/CD pipelines for model deployment, monitoring, and rollback across environments
  • Contribute to infrastructure roadmap decisions, including build vs. buy tradeoffs, tooling selection, and platform evolution as the team grows

Minimum Qualifications 
  • 5+ years of software engineering experience with a strong focus on AI infrastructure, backend systems, or distributed systems
  • Hands-on experience with AI model serving frameworks (e.g., vLLM, SGLang, Triton, TensorRT, TorchServe, or similar)
  • Understanding of container orchestration and cluster management (Kubernetes, Docker)
  • Experience deploying and operating infrastructure across both datacenter and on-prem environments
  • Strong knowledge of GPU workloads and the tradeoffs that come with them — you understand how inference differs from training, and why it matters
  • Proficiency in Python; C++, CUDA, Go, Rust a plus
  • Excellent communication skills and comfort working cross-functionally in a lean, fast-moving environment
  • Willingness to travel up to 10% of time 

Enhanced Qualifications (Nice to Have) 
  • Dynamo experience a plus
  • Experience with edge AI deployments or constrained compute environments
  • Familiarity with infrastructure as code (Terraform, Helm)
  • Experience with observability platforms (Datadog, Prometheus, Grafana)
  • Background in energy, utilities, or industrial IoT
  • Contributions to open-source ML infrastructure projects

Salary Range: $170,000 to $210,000 base compensation depending on experience plus stock options. Salary will be commensurate with an individual's skills, training, years of experience, and in line with internal compensation bands.
Location: This position can be performed remotely from anywhere in the United States. 
Our Commitments:
Utilidata values the diversity of our team. We provide equal employment opportunities without regard to race, color, religion, creed, sex, gender, sexual orientation, gender identity or expression, national origin, age, physical disability, mental disability, medical condition, pregnancy or childbirth, sexual orientation, genetics, genetic information, marital status, or status as a covered veteran or any other basis protected by applicable federal, state and local laws.
We are committed to:
  • Creating a diverse and inclusive workplace that is welcoming, supportive, affirming and respectful
  • Empowering employees to solve problems and work together to make a difference
  • Providing mentorship and growth opportunities as part of a collaborative team
  • A flexible work environment with flexible paid time off
  • Competitive compensation and benefits, including health, dental, vision, and employer-match 401k

 

Top Skills

Ai Infrastructure
C++
Cuda
Datadog
Docker
Go
Grafana
Helm
Kubernetes
Ml Model Serving Frameworks
Prometheus
Python
Rust
Terraform
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Providence, RI
77 Employees
Year Founded: 2012

What We Do

Utilidata is an AI-powered technology company that is working with NVIDIA to create the next generation of AI-embedded infrastructure, starting with the electric grid. Karman, our distributed AI platform, operates on our custom NVIDIA module, makes data available for accelerated computing at the edge, and trains AI models locally. Karman is embedded in grid devices - starting with smart meters - to transform the way utility companies operate. As the electric grid becomes more complex with the rapid increase of electric vehicles, distributed solar, batteries, heat pumps and extreme weather, utilities need real-time visibility of grid conditions and dynamic, software-defined infrastructure. Karman provides real-time visibility and AI at the grid edge so utilities can better utilize customer energy resources, reduce power outages, and enable quicker storm recovery. We are a mission-driven, collaborative, and adaptive team working to do what’s right, even when it’s hard. With backgrounds in electric engineering, power systems engineering, software engineering, data science, and energy policy, we bring a unique perspective on the solutions the energy industry needs. We are committed to ensuring a diverse, inclusive, and flexible workplace where employees are provided mentorship and growth opportunities and are empowered to solve problems as part of a collaborative team.

Similar Jobs

Andromeda (andromeda.ai) Logo Andromeda (andromeda.ai)

Senior Site Reliability Engineer

Artificial Intelligence • Cloud • Information Technology • Software
In-Office or Remote
San Francisco, CA, USA
17 Employees

Andromeda (andromeda.ai) Logo Andromeda (andromeda.ai)

Site Reliability Engineer

Artificial Intelligence • Cloud • Information Technology • Software
In-Office or Remote
San Francisco, CA, USA
17 Employees

Andromeda (andromeda.ai) Logo Andromeda (andromeda.ai)

Software Engineer

Artificial Intelligence • Cloud • Information Technology • Software
In-Office or Remote
San Francisco, CA, USA
17 Employees

Deepgram Logo Deepgram

Site Reliability Engineer

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
Remote
USA
150 Employees
150K-220K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account