This role focuses on large-scale world models for temporal reasoning and generation, including video models, multimodal generative models, LLM/VLM/VLA models, and predictive models of traffic participants and scenes. Your work will directly power Waabi World’s ability to model future evolution, synthesize realistic safety-critical scenarios, and provide rich generative priors for downstream planning, testing, and training.
You will…
- Conduct fundamental and applied research in generative and predictive world-modeling
• Video generation and prediction.
• Latent diffusion / autoregressive / flow-matching models.
• Multimodal foundation models for driving scenes.
• LLM / VLM / VLA methods for scene understanding, reasoning, and control.
• Generative scenario modeling and controllable simulation.
• Model distillation.
- Collaborate with engineers to integrate models into large-scale, distributed training and rendering pipelines.
- Publish high-impact research at top conferences (CVPR, ECCV, ICCV, NeurIPS, ICLR, ICRA, SIGGRAPH).
- Mentor junior scientists and interns; foster a culture of scientific rigor and rapid experimentation.
- Stay on top of emerging advances in generative AI, differentiable rendering, knowledge distillation/compression, and robotics.
Qualifications:
- Demonstrated technical innovation: You have a Ph.D. in Computer Vision, Machine Learning, Robotics, or a related field or equivalent research experience pushing the boundaries of a technical field..
- Strong prototyping and implementation: You have expert-level Python & PyTorch (or JAX) skills; strong software-engineering fundamentals and experience with distributed training.
- Expert domain knowledge: You have built generative or predictive models of the physical world with scale and efficiency in mind for real-world applications
- Team player: You have worked in a close-knit team of researchers and engineers and have strong communication to deliver successful projects.
Bonus:
- Proven ability to translate research into production-quality code and measurable product impact.
- Demonstrated publications (first-author) in top-tier venues on topics such as world models, generative simulation, video prediction, diffusion, flow-matching, or foundation models for autonomy.
Skills Required
- Ph.D. in Computer Vision, Machine Learning, Robotics, or related field
- Expert-level Python & PyTorch (or JAX) skills
- Experience with distributed training
- Built generative or predictive models of the physical world
What We Do
Waabi, founded by AI pioneer and visionary Raquel Urtasun, is an AI company building the next generation of self-driving technology. With a world class team and an innovative approach that unleashes the power of AI to “drive” safely in the real world, Waabi is bringing the promise of self-driving closer to commercialization than ever before. Waabi is backed by best-in-class investors across the technology, logistics and the Canadian innovation ecosystem, including Khosla Ventures, Uber, 8VC, Radical Ventures, OMERS Ventures and BDC Capital’s Women in Technology Venture Fund. To learn more visit: waabi.ai Press: [email protected] Business: [email protected]









