Research Scientist / Engineer - Efficient Modeling

Posted Yesterday
Be an Early Applicant
Mountain View, CA, USA
In-Office
Mid level
Artificial Intelligence • Computer Vision • Hardware • Robotics
The Role
Research and implement model-compression and efficiency techniques (quantization, pruning, distillation, low-rank approximations), design efficient architectures and attention mechanisms, develop training strategies, profile and benchmark models on hardware, build evaluation frameworks, and collaborate with training and deployment teams to enable real-time inference on robot hardware. Publish and present results.
Summary Generated by Built In

At Rhoda AI, we’re building the next generation of generalist intelligent robots. We own the full robotics stack from high-performance hardware and robot systems to the infrastructure and state-of-the-art foundation world models that control our robots. Our robots are designed to be generalists capable of operating in complex, real-world environments and handling long-tail edge cases, made possible by our cutting edge research and end-to-end system design. We've raised over $400M and are investing aggressively in model research, infrastructure, hardware development, and manufacturing scale-up to make generalist robotics a reality.

We're looking for a Research Scientist or Research Engineer focused on model efficiency — making our foundation world models faster, smaller, and more deployable without sacrificing capability. This work is critical to closing the gap between research-scale models and real-time operation on robot hardware.

What You'll Do

  • Research and implement model compression techniques: quantization, pruning, structured sparsity, distillation, and low-rank approximation

  • Design efficient architectures and attention mechanisms suited to real-time inference on edge and robot hardware

  • Develop training strategies that produce better accuracy-efficiency tradeoffs from the start

  • Profile and benchmark models across hardware targets to identify and resolve efficiency bottlenecks

  • Build evaluation frameworks that measure capability retention after compression or architecture changes

  • Collaborate with training systems and deployment teams to ensure efficient models translate to faster real-world inference

  • Publish and present work at top-tier venues

What We're Looking For

  • Strong understanding of model compression and efficient architectures for large models

  • Hands-on experience with quantization, distillation, or pruning applied to transformers or large neural networks

  • Deep knowledge of where efficiency gains are possible in modern architectures

  • Proficiency with PyTorch and familiarity with hardware-aware optimization (CUDA, TensorRT, or similar)

  • Ability to run principled experiments that characterize capability-efficiency tradeoffs

Nice to Have (But Not Required)

  • PhD in ML, CS, or a related field — or equivalent research/engineering experience

  • Publication record at NeurIPS, ICML, ICLR, MLSys, or related venues

  • Experience with efficient video or multimodal model architectures

  • Familiarity with edge deployment targets (Jetson, custom ASICs, or mobile hardware)

  • Prior work on speculative decoding, early exit, or adaptive compute

  • Experience deploying compressed models on physical robots or latency-constrained systems

Why This Role

  • Bridge the gap between large-scale research models and real-time robot deployments

  • Your work determines whether frontier capabilities actually run on our hardware

  • High leverage: efficiency improvements benefit every model the team trains and deploys

  • Work at a rare intersection of deep learning research and systems

Skills Required

  • Strong understanding of model compression and efficient architectures for large models
  • Hands-on experience with quantization, distillation, or pruning applied to transformers or large neural networks
  • Proficiency with PyTorch
  • Familiarity with hardware-aware optimization (CUDA, TensorRT, or similar)
  • Ability to run principled experiments characterizing capability-efficiency tradeoffs
  • PhD in ML, CS, or related field or equivalent research/engineering experience
  • Publication record at NeurIPS, ICML, ICLR, MLSys, or related venues
  • Experience with efficient video or multimodal model architectures
  • Familiarity with edge deployment targets (Jetson, custom ASICs, or mobile hardware)
  • Prior work on speculative decoding, early exit, or adaptive compute
  • Experience deploying compressed models on physical robots or latency-constrained systems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
73 Employees
Year Founded: 2024

What We Do

Rhoda AI builds robot foundation models that learn from internet-scale video to enable manipulation-capable robots to generalize in real-world industrial environments. Using a Direct Video Action architecture and its FutureVision intelligence layer, Rhoda focuses on turnkey deployments in manufacturing, logistics, and e-commerce—aiming to move robots out of controlled labs and into reliable, adaptive production settings.

Similar Jobs

Wipfli Logo Wipfli

Transaction Advisory Services Manager

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
117K-158K Annually

Wipfli Logo Wipfli

Director - Transaction Advisory Services

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
142K-191K Annually

CrowdStrike Logo CrowdStrike

Infrastructure Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
140K-215K Annually

CrowdStrike Logo CrowdStrike

Senior Software Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Hybrid
Sunnyvale, CA, USA
10000 Employees
140K-215K Annually

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account