We are seeking an exceptional Research Intern to join our core team in building the next generation of interactive Video World Models. While traditional generative AI focuses on generating passive pixels (e.g., text-to-video), our mission is fundamentally more ambitious: we are building foundational "World Models" that inherently understand physics, causality, action spaces, and complex dynamics directly from internet-scale data. Our goal is to train models that can simulate and "dream" complex virtual worlds, allowing users and agents to explore and interact with them in real time.
This is not a purely theoretical role.
Training interactive world models at this scale requires pushing the absolute limits of modern GPUs. We operate at the intersection of cutting-edge generative AI research and high-performance machine learning systems. We are looking for "full-stack" hacker-researchers—visionary thinkers who are also elite engineers, capable of co-designing novel neural architectures and engineering the highly optimized infrastructure required to train them across thousands of GPUs.What You Will Do
- Architect & Scale Foundation Models: Design, train, and scale state-of-the-art interactive world models (combining Diffusion, Autoregressive Transformers, VAEs, LLMs, VLMs) on massive video datasets.
- Push the Boundaries of ML Systems: Architect highly scalable distributed training pipelines, utilizing advanced model and data parallelism to train massive models efficiently on large-scale GPU clusters.
- Optimize for Efficiency: Profile and optimize model architectures to break through memory and compute bottlenecks. Write high-performance, custom hardware kernels to maximize Model FLOPs Utilization (MFU) and enable real-time, low-latency inference.
- Academic Excellence: Currently pursuing a PhD (or Master’s degree with a truly exceptional research/engineering track record) in Computer Science, Machine Learning, Computer Architecture, or a related field.
- Engineering Skills: Exceptional, production-level coding proficiency in Python or other languages. Background in competitive programming is a great plus.
- AI Infrastructure & Scaling: Experience with modern AI infrastructure stack and large-scale machine learning systems, such as PyTorch FSDP, Megatron, etc. Experience with GPU kernels using CUDA and/or Triton is a great plus.
- Deep Generative Expertise: Thorough theoretical and practical understanding of modern generative paradigms (Diffusion, Vision Transformers, Autoregressive sequence modeling, discrete tokenization/VAEs).
- Top-Tier Publication Record: First-author publications in top-tier AI venues (NeurIPS, ICLR, ICML, CVPR, ICCV) OR premier ML Systems venues (MLSys, OSDI, ASPLOS).
Location State(s)
US-California-Palo AltoThe expected base pay range for this position in the location(s) listed above is $80,168.40 to $124,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.Equal Employment Opportunity at TencentAs an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Top Skills
What We Do
Tencent uses technology to enrich the lives of Internet users. Our communications and social platforms Weixin and QQ connect users with each other, with digital content and daily life services in just a few clicks. Our high performance advertising platform helps brands and marketers reach out to hundreds of millions of consumers in China. Our financial technology and business services support our partners' business growth and assist their digital upgrade. We invest heavily in talent and technological innovation, actively participating in the development of the Internet industry. Tencent was founded in Shenzhen, China, in 1998, and listed on the Main Board of the Stock Exchange of Hong Kong since June 2004.








