Sr. Cloud AI Infrastructure Engineer

Reposted 25 Days Ago
Be an Early Applicant
Palo Alto, CA, USA
In-Office
145K-273K Annually
Senior level
Gaming • Software • Metaverse
The Role
Responsible for researching AI hardware accelerators, optimizing performance for cloud computing environments, defining architecture, and analyzing technology trends.
Summary Generated by Built In
Business UnitWhat the Role Entails

1.Architecture Research: Conduct in-depth research into the underlying hardware logic of various AI accelerators; evaluate the power-efficiency ratio and suitability of different heterogeneous architectures in the context of Large Language Model (LLM) inference and training.

2.Operator & Performance Optimization: Design and optimize high-performance operator libraries for large-scale cloud computing environments; resolve long-tail latency issues in hardware scheduling, memory management, and distributed communication.

3.Interconnect Architecture Definition: Define the interconnect architecture ; drive the virtualization, standardized access, and efficient pooling of heterogeneous computing resources in the cloud.

4.Technology Trend Analysis: Monitor global trends in semiconductors and accelerators; perform feasibility studies and experimental validation for the implementation of emerging technologies within cloud infrastructure.

Who We Look For

1.Education: Master’s or Ph.D. degree in Computer Engineering, Electronic Engineering, Microelectronics, or a related field.

2.Core Expertise: Expertise in GPGPU architectures or other mainstream AI accelerator architectures.

3.Programming & Frameworks: Proficient in parallel computing frameworks; deep understanding of low-level operator development languages (e.g., CUDA, Triton).

4.Network & Distributed Systems: Solid understanding of large-scale distributed systems, cluster topologies (e.g., Fat-tree, Torus), and high-performance network protocols.

5.Industry Insight: Familiar with the architectural evolution of global leading computing enterprises; ability to objectively analyze the technical pros/cons and engineering challenges of different architectural paths.

6.Experience: Experience in the application, optimization, or architectural design of ultra-large-scale accelerator clusters is preferred.

7.Framework Optimization: Experience in the low-level adaptation and performance tuning of mainstream deep learning frameworks (e.g., PyTorch, TensorFlow) is preferred.

Location State(s)

US-California-Palo Alto

The expected base pay range for this position in the location(s) listed above is $145,100.00 to $273,200.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Skills Required

  • Master's or Ph.D. in Computer Engineering or related field
  • Expertise in GPGPU or AI accelerator architectures
  • Proficient in parallel computing frameworks
  • Solid understanding of distributed systems and network protocols
  • Experience with ultra-large-scale accelerator clusters preferred
  • Experience in performance tuning of deep learning frameworks preferred

Tencent Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Tencent and has not been reviewed or approved by Tencent.

  • Healthcare Strength Healthcare coverage is positioned as a standout, with strong PPO options and relatively low prescription costs highlighted for U.S. plans. This suggests the medical offering can be a meaningful component of the overall rewards package for U.S.-based employees.
  • Retirement Support Retirement support is framed as competitive in the U.S., with employer match details called out as an item to confirm in writing. This indicates retirement benefits can be a notable strength where applicable.
  • Strong & Reliable Incentives Performance-linked incentives and share-based awards are repeatedly included as part of the compensation model, alongside potential RSU and sign-on eligibility in certain roles. This points to total rewards often extending beyond base pay through variable and equity components.

Tencent Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alto, CA
107,879 Employees
Year Founded: 1998

What We Do

Tencent uses technology to enrich the lives of Internet users. Our communications and social platforms Weixin and QQ connect users with each other, with digital content and daily life services in just a few clicks. Our high performance advertising platform helps brands and marketers reach out to hundreds of millions of consumers in China. Our financial technology and business services support our partners' business growth and assist their digital upgrade. We invest heavily in talent and technological innovation, actively participating in the development of the Internet industry. Tencent was founded in Shenzhen, China, in 1998, and listed on the Main Board of the Stock Exchange of Hong Kong since June 2004.

Similar Jobs

NVIDIA Logo NVIDIA

Infrastructure Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
2 Locations
21960 Employees
152K-288K Annually

NVIDIA Logo NVIDIA

Senior Software Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
5 Locations
21960 Employees
184K-357K Annually

NVIDIA Logo NVIDIA

Software Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
3 Locations
21960 Employees
184K-357K Annually

NVIDIA Logo NVIDIA

Software Engineer

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
In-Office or Remote
5 Locations
21960 Employees
184K-357K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account