Computer Vision Researcher (VLM)

Reposted 6 Days Ago
Be an Early Applicant
London, Greater London, England, GBR
Hybrid
Senior level
Artificial Intelligence • Computer Vision • Machine Learning • Software
The Role
Develop AI systems integrating 3D computer vision and language understanding for context-aware navigation and complex interaction in physical environments.
Summary Generated by Built In

At Niantic Spatial, we’re building the future of geospatial AI. Powered by a proprietary database of over 30 billion posed images and a groundbreaking third-generation digital map, our mission is to develop spatial intelligence that helps both humans and machines better understand, navigate, and engage with the physical world. Our high-fidelity mapping technology unlocks a new dimension of interaction—laying the foundation for AI to truly comprehend and operate within real-world environments. Join us as we build a living model of the world that people and machines can talk to.

As a Computer Vision Researcher with experience in Large Language Models (LLMs), you will bridge the gap between 3D computer vision LLMs, creating a unified framework where machines can reason about their surroundings. By linking spatial geometry directly to language, you will enable our systems to perform context-aware navigation and answer complex, open-ended questions about the physical world.

Responsibilities
  • Architect Semantic Grounding: Lead research into cross-modal grounding that connects 3D spatial features with language embeddings, enabling the LGM to "understand" object relationships and environmental context.

  • Scale "Understand" Capabilities: Develop and deploy algorithms for continuous semantics, allowing our 3D maps to evolve and improve their situational awareness as new ground-level and aerial data is ingested.

  • Agentic Frameworks: Build the "spatial brain" for Embodied AI, enabling robots, Drones and other Machines to move beyond simple navigation to mission-level reasoning.

  • Multimodal Benchmarking: Define the standards for measuring "spatial common sense" in LLMs, creating evaluations that test a model’s ability to interpret and operate within complex 3D scenes.

  • Technical Mentorship: Serve as the technical anchor for the London R&D hub, resolving architectural disagreements and mentoring the next generation of researchers in the fusion of 3D CV and NLP.

  • Collaborative Innovation: Partner with Product leads to ensure the "Understand" API delivers high business value for enterprise customers in robotics, logistics, and field operations.

Required Qualifications:

  • Education: PhD (or equivalent) in Computer Vision, Machine Learning, or Robotics with a focus on Multimodal/Semantic understanding.

  • Years of Experience: 4+ years of experience in ML research, with a proven track record of shipping models that bridge 3D Vision and Language.

  • Technical Depth: Expert knowledge of 3D Geometry (SfM, SLAM, VPS) and Transformer-based architectures (VLMs).

  • Research Impact: Multiple first-author publications at top-tier venues (CVPR, NeurIPS, ICLR) focusing on VLMs, scene understanding or semantic segmentation.

  • Implementation Mastery: Ability to write production-quality research code in PyTorch or JAX and manage large-scale data pipelines.

  • Required In-Office Days: 3 days per week

Plus If:

  • Experience with Gaussian Splatting or NeRFs for semantic scene representation.

  • Background in robotics (ROS) or building agentic systems that interact with physical environments.

  • Experience with "open-set" recognition and Zero-Shot learning.

Candidate Privacy Policy

I understand that by submitting my job application, the information I provide as part of that application will be used in accordance with Niantic Spatial’s Privacy Notice for Job Applicants and Candidates.

If required by law, by submitting my job application I consent to the processing of my information as described in that Notice, including processing information I voluntarily disclose to Niantic Spatial, such as health or medical information, race or ethnicity data, and sexual orientation data and, in limited circumstances sharing information with third parties such as references and other third parties that assist in the hiring process.

Niantic Spatial is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with reasonable accommodation during the application process, please contact your recruiter.

Skills Required

  • PhD in Computer Vision, Machine Learning, or Robotics
  • 4+ years of experience in ML research
  • Expert knowledge of 3D geometry and transformer-based architectures
  • Multiple first-author publications at top-tier venues
  • Ability to write production-quality research code in PyTorch or JAX
  • Experience in Gaussian Splatting or NeRFs is a plus
  • Background in robotics or building agentic systems is a plus
  • Experience with open-set recognition and Zero-Shot learning is a plus
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
182 Employees
Year Founded: 2025

What We Do

Niantic Spatial is building a living model of the world for machines, developing a geospatial AI model to understand and digitally map the physical world through spatial foundation and large geospatial models.

Similar Jobs

Ericsson Logo Ericsson

Architect

Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
In-Office
Reading, Berkshire, England, GBR
88000 Employees

Morningstar Logo Morningstar

Sales Director

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Hybrid
London, Greater London, England, GBR
11500 Employees
79K-115K Annually

Teya Logo Teya

Senior Security Engineer

Fintech • Payments • Financial Services
Hybrid
3 Locations
1000 Employees

Wells Fargo Logo Wells Fargo

Executive Assistant

Fintech • Financial Services
Hybrid
City of London, City and County of the City of London, England, GBR
205000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account