PhD Research Internship – Robotics Engineer (VLM / VLA Models)

Reposted 8 Days Ago
Be an Early Applicant
2 Locations
In-Office
Internship
Artificial Intelligence • Robotics • Industrial • Automation
The Role
The role involves researching and developing Vision-Language-Action systems in robotics, including model development, data analysis, and deployment.
Summary Generated by Built In

sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training.

We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today.

Join us and play a pivotal role in transforming the automation landscape in heavy industries.

Role Overview


We are seeking a highly motivated PhD candidate to join our team as a Research Intern specializing in General Purpose AI, with a focus on Vision-Language Models and Vision-Language-Action systems. This role sits at the frontier of industrial robotics: developing scalable, general-purpose VLA systems that enable robots to perceive, reason, and act autonomously in complex heavy-industry environments. You will contribute to bridging multi-modal perception (e.g., video, radar, lidar) with robust real-world execution, while advancing state-of-the-art methods in embodied AI.

Beyond engineering, this position has a strong research component, with opportunities to contribute to novel methods, publish findings, and shape the future of industrial autonomy.


Key Responsibilities


Depending on your expertise and project priorities, you will:


Research & Method Development

  • Design and develop novel approaches for Vision-Language-Action systems in real-world industrial settings

  • Explore scalable architectures for multi-modal reasoning and action generation

  • Contribute to advancing state-of-the-art methods in embodied AI and robotic autonomy


Multi-Modal Learning & Data Systems

  • Lead the design and analysis of large-scale multi-modal datasets (video, radar, lidar, sensor fusion)

  • Develop self-supervised or weakly supervised dataset generation pipelines for VLA training

  • Investigate data-centric approaches to improve robustness and generalization


Model Development & Optimization

  • Build, adapt, and extend cutting-edge GenAI models (e.g., VLMs, VLA frameworks)

  • Apply advanced fine-tuning strategies (e.g., parameter-efficient tuning, alignment methods)

  • Explore prompt optimization, reasoning augmentation, and action grounding techniques


Training, Evaluation & Benchmarking

  • Design rigorous evaluation protocols for embodied AI systems in industrial contexts

  • Run large-scale experiments, analyze performance, and iterate systematically

  • Benchmark models against state-of-the-art approaches and internal baselines


Deployment & Systems Integration

  • Collaborate with engineering teams to transition research prototypes into production-ready systems

  • Optimize models for real-time inference, robustness, and safety in heavy-industry environments


Scientific Contribution

  • Document findings and contribute to research publications, technical reports, or patents

  • Present results internally and potentially at leading conferences


Required Qualifications
  • Current enrollment in a PhD program in Robotics, Computer Science, Machine Learning, Electrical Engineering, or a related field

  • Strong programming skills in Python and deep learning frameworks (e.g., PyTorch)

  • Solid understanding of machine learning, deep learning, and multi-modal models

  • Proven ability to conduct independent research and drive projects from idea to results

  • Strong analytical thinking and problem-solving skills


Preferred Skills & Experience
  • Experience with Vision-Language Models, embodied AI, or robotics learning systems

  • Familiarity with modern GenAI tooling (e.g., Hugging Face ecosystem, Gemini, Unsloth, or similar)

  • Experience with multi-modal data (vision + sensor fusion)

  • Background in robotics, control systems, or real-world deployment

  • Track record of research output (publications, preprints, or significant research projects)

  • Experience with large-scale training, distributed systems, or model optimization


Research Environment & Outlook
  • Opportunity to work on high-impact, real-world robotics problems at the intersection of AI and industrial automation

  • Collaboration with a multidisciplinary team spanning AI research and robotics engineering

  • Potential to publish and contribute to the scientific community

  • Opportunity to shape long-term research directions and transition work into real-world deployment

What We Offer:
  • Build physical AI for the world's largest off-highway machinery – making them intelligent, safe, and ready for every tough task

  • Join the pioneer in intelligent robotics backed by Point Nine & other Tier 1 investors

  • Combine cutting-edge robotics research in end-to-end learning & Vision Language Action Model with real-world heavy mobile equipment

  • Tailor your own career path, whether you like to become technical specialist or technical team lead

  • Experience a great team culture, beverages, and an amazing office environment


Benefits:
  • Attractive compensation package and stock options.

  • Beverages on-site and regular social events.

  • Engage with top-tier researchers, engineers, and thought leaders.

  • Influence the future of robotic technologies and tackle significant technological challenges.

  • Assistance with relocation to Berlin.


About Us:

Heavy machinery, light years ahead.

sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training.

We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today.

We are proudly backed by Point Nine and other Tier 1 investors.

Skills Required

  • Current enrollment in a PhD program in Robotics, Computer Science, Machine Learning, Electrical Engineering, or related field
  • Strong programming skills in Python and deep learning frameworks
  • Solid understanding of machine learning, deep learning, and multi-modal models
  • Proven ability to conduct independent research
  • Strong analytical thinking and problem-solving skills
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
50 Employees
Year Founded: 2022

What We Do

Sensmore automates the world's largest machines with unprecedented intelligence using physical AI and robotics. They integrate cutting-edge technology to transform productivity and safety in industries like mining and construction.

Similar Jobs

Samsara Logo Samsara

Sales Manager

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Germany
4000 Employees

Samsara Logo Samsara

Account Executive

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
Germany
4000 Employees
10K-150K Annually

HERE Technologies Logo HERE Technologies

Director Of Engineering

Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
Hybrid
5 Locations
6000 Employees

Perk Logo Perk

Consultant

Artificial Intelligence • Fintech • Greentech • Sales • Software • Travel • Hospitality
Hybrid
2 Locations
1800 Employees

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account