Research Engineer - CUDA Kernel Engineering

Reposted 24 Days Ago
Be an Early Applicant
Hiring Remotely in Office, Machaze, Manica, MOZ
Remote
Mid level
Artificial Intelligence • Information Technology
The Role
Develop and optimize CUDA kernels for AI models to enhance semiconductor design and verification, ensuring efficient GPU utilization for training and inference.
Summary Generated by Built In

About Voltai
Voltai is developing world models, and agents to learn, evaluate, plan, experiment, and interact with the physical world. We are starting out with understanding and building hardware; electronics systems and semiconductors where AI can design and create beyond human cognitive limits.

About the Team

Backed by Silicon Valley’s top investors, Stanford University, and CEOs/Presidents of Google, AMD, Broadcom, Marvell, etc. We are a team of previous Stanford professors, SAIL researchers, Olympiad medalists (IPhO, IOI, etc.), CTOs of Synopsys & GlobalFoundries, Head of Sales & CRO of Cadence, former US Secretary of Defense, National Security Advisor, and Senior Foreign-Policy Advisor to four US presidents.

About the Role

You will develop, integrate, and optimize state-of-the-art CUDA kernels to power AI models that accelerate semiconductor design and verification. Your work will enable large-scale model training, inference, and reinforcement learning systems that reason about circuit layouts, generate and validate RTL, and optimize chip architectures — running efficiently across thousands of GPUs.
You’ll build tools, performance benchmarks, and integration layers that push the limits of GPU utilization for compute-intensive workloads in AI-driven hardware design. Working closely with researchers and engineers, you’ll help make Voltai the world’s leading AI + semiconductor research organization. You’ll also release your kernels and tooling as contributions to the open-source AI and HPC ecosystems.

You might thrive in this role if you have experience with

  • Writing and optimizing CUDA kernels for large-scale AI workloads (attention, routing, graph-based operations, physics-inspired operators, etc.)

  • Profiling and optimizing GPU performance for custom compute or memory-bound workloads

  • Integrating custom kernels into cutting-edge training and inference frameworks (e.g., PyTorch, Megatron, vLLM, TorchTitan)

  • Working with the latest NVIDIA hardware and software stacks (Hopper, Blackwell, NVLink, NCCL, Triton)

  • Building GPU-accelerated primitives for graph reasoning, symbolic computation, or hardware simulation tasks

  • Collaborating with AI researchers and semiconductor experts to translate domain-specific workloads into high-performance GPU code

Skills Required

  • Experience in writing and optimizing CUDA kernels for large-scale AI workloads
  • Proficient in profiling and optimizing GPU performance
  • Experience integrating custom kernels into training and inference frameworks
  • Familiarity with NVIDIA hardware and software stacks
  • Ability to build GPU-accelerated primitives for computation tasks
  • Collaboration skills with AI researchers and semiconductor experts
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
4 Employees

What We Do

AI models for electronics

Similar Jobs

Centari Logo Centari

Senior Software Engineer

Artificial Intelligence • Legal Tech • Professional Services • Software
Remote or Hybrid
Office, Machaze, Manica, MOZ
8 Employees
150K-200K Annually

Clearwater Analytics (CWAN) Logo Clearwater Analytics (CWAN)

Employment Law Attorney

Fintech • Software • Financial Services
Remote or Hybrid
2 Locations
1100 Employees
100K-168K Annually

Mondelēz International Logo Mondelēz International

European Director, Nutrition & Scientific Affairs

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
7 Locations
90000 Employees

Mondelēz International Logo Mondelēz International

Consumer Data Platforms Product Lead

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
3 Locations
90000 Employees

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account