Senior Staff AI Data Infrastructure Engineer

Posted 5 Days Ago
Be an Early Applicant
Santa Clara, CA, USA
In-Office
203K-344K Annually
Senior level
Automotive
The Role
As a Senior Staff AI Data Infrastructure Engineer, you will design scalable data pipelines, optimize training throughput, and support infrastructure evolution for AI data management.
Summary Generated by Built In
XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent mobility, XPENG is dedicated to reshaping the future of transportation through cutting-edge R&D in AI, machine learning, and smart connectivity.
 
As a core member of our AI Infrastructure team, you will work at the intersection of Autonomous Driving and Foundation Models. We don't just process EB-scale perception data from tens of thousands of production vehicles; we are building the high-performance Data Engine that powers our next-generation AI. Your work will directly determine how our self-driving systems "learn" from massive datasets and define the cognitive ceiling of multi-modal models in the physical world.
 
Key Responsibilities
  • Scalable Data Pipelines: Architect and build scalable, end-to-end pipelines to automate the ingestion, cleaning, and processing of PB-scale raw data for both production autonomy and multi-modal LLMs.
  • Modern Lakehouse Architecture: Evolve our data storage solutions based on Apache Iceberg and Lance to implement efficient semantic indexing, metadata management, and data versioning.
  • Training Throughput Optimization: Deeply optimize data loading and pre-fetching strategies to ensure maximum throughput for large-scale training on 10,000+ GPU clusters.
  • Infrastructure Evolution: Support the seamless transition of foundation model data into actionable training sets, bridging the gap between raw vehicle logs and model-ready tokens.
 
Basic Qualifications
  • Engineering Excellence: BS/MS/PhD in Computer Science or a related field, with a proven track record of building large-scale distributed systems.
  • Work Experience: 5-8 + years of industry experience.
  • Programming Mastery: Proficient in Python, C++, or Java, with a deep understanding of high-performance concurrent programming and systems design.
  • Distributed Frameworks: Hands-on experience with at least one distributed processing framework, such as Ray and Spark.
  • Lakehouse Expertise: Familiarity with Data Lakehouse concepts and practical experience with technologies like Iceberg and Lance.
 
Preferred Qualifications
  • Experience building data warehouses for Trillion-token datasets or PB-scale multi-modal data.
  • Deep understanding of data access patterns in deep learning frameworks like PyTorch, DeepSpeed, or Megatron.
  • Practical experience with Vector Databases, automated labeling toolchains, or data-centric AI workflows.
  • Knowledge of storage formats optimized for AI (e.g., Parquet, Lance) and high-performance file systems.

What do we provide:
  • A fun, supportive and engaging environment.
  • Infrastructures and computational resources to support your work.
  • Opportunity to work on cutting edge technologies with the top talents in the field.
  • Opportunity to make significant impact on the transportation revolution by the means of advancing autonomous driving.
  • Competitive compensation package.
  • Snacks, lunches, dinners, and fun activities.
 
The base salary range for this full-time position is $203,450-$344,300, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.
 
We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Palo Alto, CA
993 Employees
Year Founded: 2014

What We Do

Xpeng Motors is a leading Chinese electric vehicle and technology company that designs and manufactures intelligent automobiles that are seamlessly integrated with the Internet and utilize the latest advances in artificial intelligence. Focusing on China’s young and tech-savvy consumer base, XPENG Motors strives to offer smart mobility solutions with technology innovation and cutting-edge R&D. The company’s initial backers include its CEO & Chairman He Xiaopeng, the founder of UCWeb Inc. and a former Alibaba executive. It was co-founded in 2014 by Henry Xia and He Tao, former senior executives at Guangzhou Auto with expertise in innovative automotive technology and R&D. It has received funding from prominent Chinese and international investors including Alibaba Group, Foxconn Group and IDG Capital. Currently with 3,000 employees, the company is headquartered in Guangzhou and has design, R&D, manufacturing and sales & marketing divisions in Silicon Valley, San Diego, Beijing, Shanghai, Zhaoqing (Guangdong Province) and Zhengzhou (Henan Province).

Similar Jobs

In-Office
Santa Clara, CA, USA
993 Employees
203K-344K Annually

RoboForce Logo RoboForce

Senior / Staff AI Research Engineer, Data Infrastructure

Artificial Intelligence • Machine Learning • Robotics
In-Office
Milpitas, CA, USA
14 Employees

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Supervisor I

eCommerce • Fashion • Other • Retail • Sales • Wearables • Design
Hybrid
Torrance, CA, USA
16000 Employees
17-28 Hourly

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Sales Support Associate II

eCommerce • Fashion • Other • Retail • Sales • Wearables • Design
Hybrid
Beverly Glen, CA, USA
16000 Employees
15-24 Hourly

Similar Companies Hiring

Cox Enterprises Thumbnail
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
Atlanta, GA
50000 Employees
UL Solutions Thumbnail
Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Chicago, IL
15000 Employees
HERE Technologies Thumbnail
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account