Distinguished Engineer, Apache Spark

Reposted 2 Days Ago
Be an Early Applicant
2 Locations
In-Office
308K-472K
Expert/Leader
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Role
The Distinguished Engineer will lead the architecture and implementation of Apache Spark acceleration, engage with open source communities, and mentor engineering teams.
Summary Generated by Built In
Distinguished Engineer - Accelerated Apache Spark

NVIDIA is seeking a Distinguished Engineer for the Apache Spark Acceleration group.

Over the past five years GPU accelerated data processing has moved from proof of concept to production deployments. Many enterprises are now recognizing the needs of accelerated computing for large scale data processing. Multi-node GPU deployments will reduce cloud computing costs and lower latency of large scale data processing.

At NVIDIA, we have been invested in accelerating Apache Spark, providing an open source plugin for Apache Spark. Apache Spark is the most popular data processing engine in data centers. We strive to accelerate Spark applications on GPUs without any code changes. Our OSS RAPIDS Spark library is integrated with on-premise and cloud services such as AWS EMR, Databricks, Google Dataproc, Oracle Cloud Data Flow, Bytedance Volcengine, Tencent Cloud and Cloudera.

You will serve as a hands-on architect of Nvidia Spark Acceleration Group. You will work with a team of distributed system engineers including PMC and Committers of Apache Spark, Apache Hadoop, Apache Hive, and Apache Arrow. You will engage in open source projects such as Apache Spark, RAPIDS, Apache Iceberg, Delta Lake, UCX and more.

What you'll be doing:
  • Lead the architecture, design and implementation of accelerated Apache Spark and related big-data frameworks
  • Engage open source communities (including Apache Spark, RAPIDS, Apache Iceberg, Delta Lake and UCX) for technical discussion and contribution, and engage new communities where we may not have a strong presence yet
  • Work with NVIDIA partners to deploy GPU enabled data analytics solutions in public cloud or on-premises clusters
  • Present technical solutions at industry conferences and meetups
  • Collaborate with distributed systems teams to define solutions to distributed processing problems challenges at large scale
  • Provide recommendations and feedback to teams regarding decisions surrounding topics such as infrastructure, continuous integration and testing strategy
  • Build, test and optimize CUDA/C++ libraries across different platforms
  • Build automation and tools that will increase the productivity of teams developing distributed systems
  • Mentor members of the engineering team
What we need to see:
  • BS, MS, or PhD in Computer Science, Computer Engineering, or closely related field (or equivalent experience)
  • 17+ years of work or research experience in software development
  • Prior experience in delivering complex software projects as a lead architect
  • Outstanding technical skills in designing and implementing high-quality distributed systems
  • Excellent programming skills in C++, Java, and/or Scala

  • Highly motivated with strong interpersonal skills and communication skills
  • 5+ years working experience with key open source big-data projects as a contributor or committer to Apache Spark, Apache Hadoop, Apache Flink, Apache Kafka, Apache Hive, Apache Arrow, Delta Lake
  • Excellent knowledge about distributed system schedulers: Kubernetes, Hadoop YARN, Apache Spark
  • Able to delve into a new area and quickly come up to speed
  • Able to work with teams across boundaries and geographies

Ways to stand out from the crowd:
  • Working experience in designing and developing columnar query engines would be a huge plus
  • Committership at major open source projects (such as Apache Spark, Apache Hadoop, Apache Flink) is a big plus
  • Working experience with acceleration libraries (CUDA, RAPIDS, UCX) is helpful

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 308,000 USD - 471,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until August 5, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

Top Skills

Apache Arrow
Apache Flink
Apache Hive
Apache Kafka
Spark
C++
Cuda
Delta Lake
Hadoop
Java
Kubernetes
Rapids
Scala
Ucx
Yarn
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Santa Clara, CA
21,960 Employees
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Ahold Delhaize USA Logo Ahold Delhaize USA

Data Engineer

AdTech • eCommerce • Food • Marketing Tech • Retail
In-Office
Chicago, IL, USA
21-37
Easy Apply
Hybrid
Chicago, IL, USA
50K-60K

NinjaTrader Logo NinjaTrader

New Accounts Associate

Fintech • Software • Financial Services
Easy Apply
Remote or Hybrid
2 Locations
54K-54K

NinjaTrader Logo NinjaTrader

Quality Assurance Manager

Fintech • Software • Financial Services
Easy Apply
Remote or Hybrid
2 Locations
145K-165K

Similar Companies Hiring

Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account