Staff AI Infrastructure Engineer

Posted 17 Days Ago
Be an Early Applicant
Santa Clara, CA
Senior level
Automotive
The Role
The role focuses on enhancing AI/ML infrastructure to improve productivity for researchers. Responsibilities include resolving infrastructure gaps, developing scalable solutions, optimizing performance, and collaborating with various teams to create an integrated AI/ML infrastructure ecosystem.
Summary Generated by Built In

XPeng Motors is one of China’s leading smart electric vehicle (EV) companies. We design, develop, and manufacture smart EVs that are seamlessly integrated with advanced Internet, AI and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers. We strive to transform smart electric vehicles with technology and data, shaping the mobility experience of the future.

 

We are looking for a talented AI/ML Infrastructure Engineer to join our team. In this role, you will have the opportunity to improve productivity for our researchers by enhancing the entire stack. Your primary duty will be to identify and resolve infrastructure gaps to provide reliable, efficient, and scalable solutions.

 

Job Responsibilities:

  • Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable solutions

  • Develop advanced AI/ML infrastructure solutions that enhance the efficiency of our skilled ML teams

  • Design and implement solutions for critical areas, including distributed storage systems, scheduling systems, high availability capabilities, and core reliability issues within our large-scale GPU clusters

  • Monitor and optimize the performance of our AI/ML infrastructure, ensuring high availability, scalability, and efficient resource utilization

  • Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks

  • Work with various teams, including ML developers, data engineers, and DevOps professionals, to create a cohesive and integrated AI/ML infrastructure ecosystem

Minimum Skill Requirements:

  • Bachelor's degree in Computer Science, Engineering, or related technical field

  • 5-8+ years of experience in software engineering, with a strong background in developing and managing large-scale distributed systems, ideally within the AI/ML infrastructure domain

  • Proficiency in programming languages such as Python, Go, or C++, with knowledge of cloud computing platforms like AWS, Azure, etc.

  • Strong communication and collaboration abilities, effective in working with diverse teams and individuals

 

Preferred Skill Requirements:

  • In-depth understanding of AI/ML workflows, including model training, data processing, and inference pipelines

  • Practical experience with containerization technologies (i.e., Docker, Kubernetes), automation tools (i.e., Ansible, Terraform), and monitoring solutions (i.e., Prometheus, Grafana)

  • Exceptional problem-solving skills, capable of analyzing complex systems, identifying bottlenecks, and implementing scalable solutions

  • A passion for continuous learning and staying abreast of new technologies and best practices in the AI/ML infrastructure space

What do we provide:

  • A fun, supportive and engaging environment

  • Opportunity to make significant impact on the transportation revolution by the means of advancing autonomous driving

  • Opportunity to work on cutting edge technologies with the top talent in the field

  • Competitive compensation package

  • Snacks, lunches and fun activities

 

The base salary range for this full-time position is $180,000-$300,000, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

 

We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.

Top Skills

C++
Go
Python
The Company
Palo Alto, CA
993 Employees
On-site Workplace
Year Founded: 2014

What We Do

Xpeng Motors is a leading Chinese electric vehicle and technology company that designs and manufactures intelligent automobiles that are seamlessly integrated with the Internet and utilize the latest advances in artificial intelligence. Focusing on China’s young and tech-savvy consumer base, XPENG Motors strives to offer smart mobility solutions with technology innovation and cutting-edge R&D. The company’s initial backers include its CEO & Chairman He Xiaopeng, the founder of UCWeb Inc. and a former Alibaba executive. It was co-founded in 2014 by Henry Xia and He Tao, former senior executives at Guangzhou Auto with expertise in innovative automotive technology and R&D. It has received funding from prominent Chinese and international investors including Alibaba Group, Foxconn Group and IDG Capital. Currently with 3,000 employees, the company is headquartered in Guangzhou and has design, R&D, manufacturing and sales & marketing divisions in Silicon Valley, San Diego, Beijing, Shanghai, Zhaoqing (Guangdong Province) and Zhengzhou (Henan Province).

Similar Jobs

Crunchyroll Logo Crunchyroll

Senior Software Engineer - Web Video Players

Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
San Francisco, CA, USA
1200 Employees
185K-232K Annually

Atlassian Logo Atlassian

Principal Application Engineer, Marketing Technologies

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
171K-274K Annually

Square Logo Square

Software Engineer, Orders Core

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote
Hybrid
Los Angeles, CA, USA
12000 Employees
139K-245K Annually

Square Logo Square

iOS Software Engineer, Services Mobile

eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Remote
Hybrid
8 Locations
12000 Employees
139K-245K Annually

Similar Companies Hiring

Chamberlain Group Thumbnail
Software • PropTech • Mobile • Internet of Things • Hardware • Automotive • App development
Oak Brook, IL
5637 Employees
Cox Enterprises Thumbnail
Software • Other • Information Technology • Greentech • Cybersecurity • Cloud • Automotive
Atlanta, GA
50000 Employees
UL Solutions Thumbnail
Software • Renewable Energy • Professional Services • Energy • Consulting • Chemical • Automotive
Chicago, IL
15000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account