Hunyuan Multimodal Reinforcement Learning Research Intern

Reposted 25 Days Ago
Be an Early Applicant
Palo Alto, CA, USA
In-Office
80K-125K Annually
Internship
Gaming • Software • Metaverse
The Role
Conduct research and develop algorithms for multimodal reinforcement learning, focusing on designing RL infrastructure and reward modeling strategies.
Summary Generated by Built In
Business UnitWhat the Role EntailsResponsibilities:

1. Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks.

2. Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.

3. Explore next-generation RL paradigms that more directly and effectively learn from environment feedback.

Who We Look ForRequirements:

1. Currently enrolled as a PhD student in Computer Science or a closely related field.

2. Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, SIGGRAPH.

3. Strong hands-on programming skills, with solid experience in deep learning system implementation, model training and inference optimization, CPU/GPU acceleration, and distributed training and inference.

4. Prior experience with diffusion models, autoregressive models, and/or text-to-image or text-to-video generation is highly preferred.

5. Participation in ACM/NOIP is a strong plus.

Location State(s)

US-California-Palo Alto

The expected base pay range for this position in the location(s) listed above is $80,168.40 to $124,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Top Skills

Autoregressive Models
Cpu Acceleration
Deep Learning
Diffusion Models
Gpu Acceleration
Multimodal Models
Reinforcement Learning
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
107,879 Employees
Year Founded: 1998

What We Do

Tencent uses technology to enrich the lives of Internet users. Our communications and social platforms Weixin and QQ connect users with each other, with digital content and daily life services in just a few clicks. Our high performance advertising platform helps brands and marketers reach out to hundreds of millions of consumers in China. Our financial technology and business services support our partners' business growth and assist their digital upgrade. We invest heavily in talent and technological innovation, actively participating in the development of the Internet industry. Tencent was founded in Shenzhen, China, in 1998, and listed on the Main Board of the Stock Exchange of Hong Kong since June 2004.

Similar Jobs

CDW Logo CDW

Consultant

Information Technology
Remote or Hybrid
US
15100 Employees
67K-97K Annually

CDW Logo CDW

Architect

Information Technology
Remote or Hybrid
US
15100 Employees
94K-132K Annually

CDW Logo CDW

PMO Manager

Information Technology
Remote or Hybrid
US
15100 Employees
121K-170K Annually
Remote or Hybrid
US
15100 Employees
52K-95K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account