Technical Lead, Spark

Posted 10 Days Ago
Be an Early Applicant
4 Locations
In-Office or Remote
Senior level
Big Data • Software • Analytics
The Role
Lead design and implementation of Spark and Livy-based features for large-scale distributed data processing. Contribute to Apache projects, develop in Java/Scala/Python, debug production clusters, improve infrastructure, and collaborate across distributed teams to deliver enterprise-grade data engineering capabilities.
Summary Generated by Built In

Business Area:

Engineering

Seniority Level:

Mid-Senior level

Job Description: 

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance. 

Cloudera is seeking a Senior Staff Software Engineer, Spark (Java) with strong distributed systems expertise to work on the Cloudera distribution of Apache Spark and Livy. The role involves building enterprise-grade systems for customers running Spark on thousands of nodes and processing petabytes of data.

We are looking for a passionate engineer eager to enhance a product already supporting major production systems and to drive the next-generation Data Engineering experience. You will collaborate with a distributed team across the United States and Hungary, including multiple Apache Spark committers.

This role is not eligible for immigration sponsorship or relocation.

As a Senior Staff Software Engineer, you will:

  • Design new features for Cloudera’s data engineering experience, and take them from prototypes to leading a team to deliver the feature in production at scale

  • Contribute to Apache Spark, Livy

  • Develop new features in Scala/Java/Python on a modern platforms

  • Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems.

  • Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark, which you can utilize in your daily tasks

  • Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters

  • Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures

  • Work on improving internal infrastructure

  • Collaborate with other team members and stakeholders

We are excited if you have (Required Experience):

  • Bsc/Msc in related field or equivalent experience

  • 6+ years professional software development.

  • Experience leading and delivering complex product enhancements.

  • We use Java/Scala/Python in projects, you should have a strong understanding of at least one of the following languages: Java, Scala, Python. And interested to learn the languages we’re using.

  • Experience with systems design, development.

  • Passionate about programming, clean coding habits, attention to detail, and focus on quality

  • Strong oral and written communication skills.

  • Strong ability to research and solve problems independently without constant supervision

  • (Most importantly) Open-minded, desire to learn new things and build great products.

  • Experience with distributed systems

You may also have:

  • Experience with SQL planners

  • Experience with using/developing Apache Spark, Livy or other related technologies.

  • Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling.

  • Solid experience with at least one cloud s

Why this role matters: 

You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that powers CDP and keeps it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modeling.

Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardize best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-AO1

#LI-HYBRID

#LI-REMOTE

Skills Required

  • BSc/MSc in related field or equivalent experience
  • 6+ years professional software development
  • Experience leading and delivering complex product enhancements
  • Strong understanding of at least one language: Java, Scala, or Python
  • Experience with systems design and development
  • Experience with distributed systems
  • Passion for programming, clean coding habits, attention to detail, and focus on quality
  • Strong oral and written communication skills
  • Strong ability to research and solve problems independently
  • Open-mindedness and desire to learn new technologies
  • Experience with SQL planners
  • Experience using or developing Apache Spark, Livy or related technologies
  • Experience with large-scale distributed systems design, scaling, performance, and scheduling
  • Solid experience with at least one cloud provider

Cloudera Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Cloudera and has not been reviewed or approved by Cloudera.

  • Leave & Time Off Breadth Time off includes generous PTO and holidays plus recurring company‑wide Unplugged Days that provide regular recharge time. Volunteer time off and flexible scheduling options further expand usable leave.
  • Healthcare Strength Health coverage spans comprehensive medical, dental, and vision alongside EAP, wellness sessions, and U.S. gym reimbursement. These elements position healthcare as a strong anchor within the package.
  • Strong & Reliable Incentives Compensation often includes variable incentives and long‑term incentive programs with annual bonuses commonly offered. Sales and other revenue roles show competitive on‑target earnings when goals are met, reinforcing the incentive structure.

Cloudera Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alot, CA
3,092 Employees
Year Founded: 2008

What We Do

At Cloudera, we believe that data can make what is impossible today, possible tomorrow. We empower people to transform complex data into clear and actionable insights. Cloudera delivers an enterprise data cloud for any data, anywhere, from the Edge to AI. Powered by the relentless innovation of the open source community,

Similar Jobs

Halter Logo Halter

Account Manager

Greentech • Hardware • Internet of Things • Machine Learning • Software • Business Intelligence • Agriculture
In-Office or Remote
Canterbury, Kent, England, GBR
350 Employees

Perk Logo Perk

Consultant

Artificial Intelligence • Fintech • Greentech • Sales • Software • Travel • Hospitality
Remote or Hybrid
4 Locations
1800 Employees

Teya Logo Teya

Sales Manager

Fintech • Payments • Financial Services
In-Office or Remote
Leeds, West Yorkshire, England, GBR
1000 Employees
45K-65K Annually

Veeva Logo Veeva

Corporate Events Associate

Big Data • Cloud • Healthtech • Software • Big Data Analytics
In-Office or Remote
London, Greater London, England, GBR
6000 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account