Principal Engineer - GPU and LLM Infrastructure

Posted 9 Days Ago
Be an Early Applicant
3 Locations
Hybrid
Senior level
Fintech • Financial Services
Wells Fargo: Tech-powered. Innovation-led. We're transforming financial services.
The Role
About this role:
Wells Fargo is seeking a Principal Engineer - GPU & LLM Infrastructure to lead the end-to-end strategy and operations of our enterprise GPU platforms within Digital Technology's AI Capability Engineering group. In this role, you will design and evolve GPU architecture across on-premises and cloud environments, guide POCs through production readiness, and oversee Day-2 operations for large-scale, multi-cloud deployments.
You will serve as the technical authority for Nvidia/Run:AI orchestration, drive alignment with OpenShift AI, and enable high-performance LLM/SLM inferencing using Triton and vLLM. A core part of the role is ensuring our GenAI platforms are secure, resilient, scalable, and fully observable to meet the demands of enterprise-grade AI workloads.
In this role, you will:
  • Act as an advisor to leadership to develop or influence GPU buildout for highly complex business and technical needs across multiple groups
  • Lead the strategy and resolution of highly complex and unique challenges requiring in-depth evaluation across multiple areas or the enterprise, delivering solutions that are long-term, large-scale and require vision, creativity, innovation, advanced analytical and inductive thinking
  • Translate advanced technology experience, an in-depth knowledge of the organizations tactical and strategic business objectives, the enterprise technological environment, the organization structure, and strategic technological opportunities and requirements into technical engineering solutions
  • Provide vision, direction and expertise to leadership on implementing innovative and significant business solutions
  • Maintain knowledge of industry best practices and new technologies and recommends innovations that enhance operations or provide a competitive advantage to the organization
  • Strategically engage with all levels of professionals and managers across the enterprise and serve as an expert advisor to leadership
  • Design and implement GPU cluster topologies (H100/H200, NVLink/NVSwitch), networking, and storage paths for high-throughput inferencing; publish sizing and performance baselines.

Required Qualifications:
  • 7+ years of Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

Desired Qualifications:
  • 1+ years of experience with NVIDIA GPU and CUDA ecosystems, including CUDA, cuDNN, NVLink/NVSwitch, MIG, NCCL, GPU profilers, and performance tuning for H100/H200 architectures
  • 1+ years of experience with LLM/SLM runtimes, such as vLLM, TensorRT-LLM, and Triton; hands-on work with model quantization (FP8, INT4 AWQ/GPTQ), KV-cache optimization strategies, and disaggregated prefill/decode pipelines
  • 1+ years of experience in orchestration and GPU workload management, including GPU resource managers (collections/departments/projects/workloads), OCP/GKE administration, quota management, preemption and fair-share enforcement, GPU scheduling and timeslicing, Helm/Kustomize, upgrade validation, and admission controls
  • 1+ years of experience with API and gateway platforms, including Apigee authentication/authorization, quota and rate-limit configuration, OpenAPI specifications, SDK generation, SLA operations, and API versioning/deprecation workflows
  • 1+ years of experience in observability and evaluation tooling, including Arize-like systems for tracing and evaluations, SLO development, alerting design, retention/export workflows, and dashboard creation
  • 1+ years of experience in performance engineering, including throughput and latency modeling (token/sec, batch shaping, cache policies) and cost/performance optimization strategies for LLM/SLM workloads

Job Expectations:
  • Hybrid onsite at required locations
  • No visa sponsorship available.
  • No relocation assistance for this position.

Top Skills

APIs
Cuda
Docker
Gpu
Kubernetes
Mig
Nccl
Nvlink
Nvswitch
Openshift Ai
Tensorrt-Llm
Triton
Vllm
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
205,000 Employees
Year Founded: 1852

What We Do

Wells Fargo & Company (NYSE: WFC) is a leading financial services company that has approximately $2.1 trillion in assets. We provide a diversified set of banking, investment and mortgage products and services, as well as consumer and commercial finance, through our four reportable operating segments: Consumer Banking and Lending, Commercial Banking, Corporate and Investment Banking, and Wealth & Investment Management. Wells Fargo ranked No. 33 on Fortune’s 2025 rankings of America’s largest corporations. Our technology professionals drive innovation, information security, and big data analytics while maintaining a network that handles more than 12 billion customer interactions a year. Join us! Are you looking for more? Find it here. At Wells Fargo, we're more than a financial services leader – we’re a global trailblazer committed to driving innovation, empowering communities, and helping our customers succeed. We believe that a meaningful career is much more than just a job – it’s about finding all of the elements to help you thrive, in one place. Living the Well Life means you’re supported in life, not just work. It means having robust benefits, competitive compensation, and programs designed to help you find work-life balance and well-being. You’ll be rewarded for investing in your community, celebrated for being your authentic self, and empowered to grow. And we’re recognized for it – Wells Fargo once again ranked in the top three – making us the #1 financial services employer – on the 2025 LinkedIn Top Companies list of best workplaces “to grow your career” in the U.S. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic. © 2026 Wells Fargo Bank, N.A. All rights reserved. Member FDIC.

Why Work With Us

We're known for our “Well Life” approach to supporting employees’ career aspirations, work-life balance, and mental and physical health. We ranked in the top 3 on the 2025 LinkedIn Top Companies list – and #1 among financial services companies – as the best workplace “to grow your career” in the U.S.

Gallery

Gallery
Gallery
Gallery
Gallery

Wells Fargo Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: 3 days a week
HQSan Francisco, CA
Bangalore, Bangalore
Belfast, GB
Bengaluru, Karnataka
Chandler, AZ
Charlotte, NC
Technology Center
Hyderabad, Telangana
Irving, TX
New York, NY
New York, NY
Phoenix, AZ
Learn more

Similar Jobs

Hybrid
3 Locations
205000 Employees

Wells Fargo Logo Wells Fargo

Consultant

Fintech • Financial Services
Hybrid
3 Locations
205000 Employees
Hybrid
Grapevine, TX, USA
205000 Employees

Wells Fargo Logo Wells Fargo

Data Management Senior Manager

Fintech • Financial Services
Hybrid
Irving, TX, USA
205000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account