Rackspace Technology

AI Model Serving Specialist

Posted Yesterday

Be an Early Applicant

Hiring Remotely in United States

Remote

82K-141K Annually

Mid level

Cloud • Information Technology • Software

The Role

Enable and support enterprise customers in deploying, optimizing AI workloads, and integrating model-serving platforms for secure, efficient inference services.

Summary Generated by Built In

Role Purpose

Enable enterprise customers to operationalize AI workloads by deploying and optimizing model-serving platforms (e.g., NVIDIA Triton, vLLM, KServe) within Rackspace’s Private Cloud and Hybrid environments. This role bridges AI engineering and platform operations, ensuring secure, scalable, and cost-efficient inference services.

Key Responsibilities : -

Model Deployment & Optimization

Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters.

Tune performance (batching, KV-cache, TensorRT optimizations) for latency and throughput SLAs.

Platform Integration

Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy.

Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers.

API & Service Enablement

Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing.

Support RAG and agentic workflows by connecting to vector databases and context stores.

Observability & FinOps

Configure telemetry for GPU utilization, request tracing, and error monitoring.

Collaborate with FinOps to enable usage metering and chargeback reporting.

Customer Engineering Support

Assist solution architects in onboarding customers, creating reference patterns for BFSI, Healthcare, and other verticals.

Provide troubleshooting and performance benchmarking guidance.

Continuous Improvement

Stay current with emerging model-serving frameworks and GPU acceleration techniques.

Contribute to reusable Helm charts, operators, and automation scripts.

Required Skills & Experience

Hands-on experience with NVIDIA Triton, vLLM, or similar serving stacks.
Strong knowledge of Kubernetes, GPU scheduling, and CUDA/MIG.
Familiarity with VMware VCF9, NSX-T networking, and vSAN storage classes.
Proficiency in Python and containerization (Docker).
Understanding of observability stacks (Prometheus, Grafana) and FinOps principles.
Exposure to RAG architectures, vector DBs, and secure multi-tenant environments.
Excellent problem-solving and customer-facing communication skills.

Preferred Certifications

NVIDIA Certified Professional (AI/ML)
Kubernetes Administrator (CKA)
VMware VCF Specialist
Rackspace AI Foundations (internal)

KPI's

Model deployment success rate and SLA compliance.
Latency/throughput benchmarks per SKU.
Customer satisfaction (NPS) for AI services.
Efficiency in GPU utilization and cost optimization.

Physical Demands

General office environment: no special physical demands required.
May require long periods of sitting and viewing a computer monitor.
Schedule flexibility to include working weekends and/or evenings and holidays as required by the business for 24/7 operations.

Travel

As per business needs

Sponsorship

This role is not sponsorship eligible
Candidates need to be legally allowed to work in the US for any employer

#LI-VM1

#LI-US

"Remote postings are limited to candidates residing within the country specified in the posting location"

About Rackspace Technology

We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.

More on Rackspace Technology

Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

Top Skills

Cuda

Docker

Grafana

Kubernetes

Nsx-T

Nvidia Triton

Prometheus

Python

Vllm

Vmware Vcf9

Vsan

View all jobs at Rackspace Technology

View Rackspace Technology Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: San Antonio, TX

7,509 Employees

Year Founded: 1998

What We Do

At Rackspace Technology, we accelerate the value of the cloud during every phase of digital transformation. By managing apps, data, security and multiple clouds, we are the best choice to help customers get to the cloud, innovate with new technologies and maximize their IT investments. As a recognized Gartner Magic Quadrant leader, we are uniquely positioned to close the gap between the complex reality of today and the promise of tomorrow. Passionate about customer success, we provide unbiased expertise, based on proven results, across all the leading technologies. And across every interaction worldwide, we deliver Fanatical Experience TM — the best customer service experience in the industry. Rackspace has been honored by Fortune, Forbes, Glassdoor and others as one of the best places to work.