Niantic Spatial, Inc.

Computer Vision Researcher (VLM)

Reposted 2 Days Ago

Be an Early Applicant

London, Greater London, England, GBR

Hybrid

Senior level

Artificial Intelligence • Computer Vision • Machine Learning • Software

The Role

Develop AI systems integrating 3D computer vision and language understanding for context-aware navigation and complex interaction in physical environments.

Summary Generated by Built In

At Niantic Spatial, we’re building the future of geospatial AI. Powered by a proprietary database of over 30 billion posed images and a groundbreaking third-generation digital map, our mission is to develop spatial intelligence that helps both humans and machines better understand, navigate, and engage with the physical world. Our high-fidelity mapping technology unlocks a new dimension of interaction—laying the foundation for AI to truly comprehend and operate within real-world environments. Join us as we build a living model of the world that people and machines can talk to.

As a Computer Vision Researcher with experience in Large Language Models (LLMs), you will bridge the gap between 3D computer vision LLMs, creating a unified framework where machines can reason about their surroundings. By linking spatial geometry directly to language, you will enable our systems to perform context-aware navigation and answer complex, open-ended questions about the physical world.

Responsibilities

Architect Semantic Grounding: Lead research into cross-modal grounding that connects 3D spatial features with language embeddings, enabling the LGM to "understand" object relationships and environmental context.
Scale "Understand" Capabilities: Develop and deploy algorithms for continuous semantics, allowing our 3D maps to evolve and improve their situational awareness as new ground-level and aerial data is ingested.
Agentic Frameworks: Build the "spatial brain" for Embodied AI, enabling robots, Drones and other Machines to move beyond simple navigation to mission-level reasoning.
Multimodal Benchmarking: Define the standards for measuring "spatial common sense" in LLMs, creating evaluations that test a model’s ability to interpret and operate within complex 3D scenes.
Technical Mentorship: Serve as the technical anchor for the London R&D hub, resolving architectural disagreements and mentoring the next generation of researchers in the fusion of 3D CV and NLP.
Collaborative Innovation: Partner with Product leads to ensure the "Understand" API delivers high business value for enterprise customers in robotics, logistics, and field operations.

Required Qualifications:

Education: PhD (or equivalent) in Computer Vision, Machine Learning, or Robotics with a focus on Multimodal/Semantic understanding.
Years of Experience: 4+ years of experience in ML research, with a proven track record of shipping models that bridge 3D Vision and Language.
Technical Depth: Expert knowledge of 3D Geometry (SfM, SLAM, VPS) and Transformer-based architectures (VLMs).
Research Impact: Multiple first-author publications at top-tier venues (CVPR, NeurIPS, ICLR) focusing on VLMs, scene understanding or semantic segmentation.
Implementation Mastery: Ability to write production-quality research code in PyTorch or JAX and manage large-scale data pipelines.
Required In-Office Days: 3 days per week

Plus If:

Experience with Gaussian Splatting or NeRFs for semantic scene representation.
Background in robotics (ROS) or building agentic systems that interact with physical environments.
Experience with "open-set" recognition and Zero-Shot learning.

Candidate Privacy Policy

I understand that by submitting my job application, the information I provide as part of that application will be used in accordance with Niantic Spatial’s Privacy Notice for Job Applicants and Candidates.

If required by law, by submitting my job application I consent to the processing of my information as described in that Notice, including processing information I voluntarily disclose to Niantic Spatial, such as health or medical information, race or ethnicity data, and sexual orientation data and, in limited circumstances sharing information with third parties such as references and other third parties that assist in the hiring process.

Niantic Spatial is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with reasonable accommodation during the application process, please contact your recruiter.

Skills Required

PhD in Computer Vision, Machine Learning, or Robotics
4+ years of experience in ML research
Expert knowledge of 3D geometry and transformer-based architectures
Multiple first-author publications at top-tier venues
Ability to write production-quality research code in PyTorch or JAX
Experience in Gaussian Splatting or NeRFs is a plus
Background in robotics or building agentic systems is a plus
Experience with open-set recognition and Zero-Shot learning is a plus

View all jobs at Niantic Spatial, Inc.

View Niantic Spatial, Inc. Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

182 Employees

Year Founded: 2025

What We Do

Niantic Spatial is building a living model of the world for machines, developing a geospatial AI model to understand and digitally map the physical world through spatial foundation and large geospatial models.