Staff Software Engineer, Storage

Reposted 15 Days Ago
Be an Early Applicant
Hiring Remotely in Hungary
Remote
Senior level
Big Data • Software • Analytics
The Role
The Staff Software Engineer will design and implement features for Apache Ozone, contribute to the open-source community, mentor junior engineers, and support enterprise customers with scalable data solutions.
Summary Generated by Built In

Business Area:

Engineering

Seniority Level:

Mid-Senior level

Job Description: 

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance. 

Cloudera is looking for an exceptional and passionate software engineer with a strong distributed systems background to join the Storage Engineering team focused on building Apache Ozone. The Storage team is responsible for primary storage and storage access layers, which are core to the platform. They created and wrote most of the HDFS code and made a huge impact on the big data and cloud computing industry. Apache Ozone (Apache Ozone) provides a massively scalable distributed object store with a distributed file system interface. Ozone is designed to scale to tens of billions of files and blocks, and overcome the limitations of Hadoop Distributed File System (HDFS), namely, millions of small files and managing a huge number of datanodes. 

Ozone is one of the fastest-growing products inside CDP in terms of customer adoption and expansion revenue.

As a Staff Software Engineer, you will…

  • You will be directly involved in the design and implementation of the core feature set of Apache Ozone and Apache Ratis (open-source RAFT implementation) 

  • You will regularly contribute code and design docs to the Apache open-source community.

  • As part of storage engineering, you will support enterprise customers running 100s of petabytes-scale big data analytics and ML/AI pipelines. 

  • You will partner with product managers and cross-functional teams as a part of the Cloudera Data Platform ecosystem in understanding requirements and turning them into a solid design and implementation, and facilitating integration and adoption.

  • You will be responsible for leading and collaborating with a talented group of engineers working on a feature and mentoring junior engineers.

We are excited if you have…

  • Bachelor's +6, Master's 4-6 years of relevant industry experience required

  • Strong backend engineering skill set with expertise in Java, or strong C++ skills, with intermediate Java expertise

  • Passionate about programming. Clean coding habits, attention to detail, and focus on quality

  • Experience with large-scale, distributed systems design and development with a strong understanding of scaling, replication, consistency, and high availability

  • Solid experience with system software design and development with a strong understanding of computer architecture, storage, network, and IO subsystems, and distributed systems

  • Hands-on programmer with strong data structures and algorithms skillset

  • Strong oral and written communication skills

You may also have..

  • Strong background in a distributed storage system, including file systems, database storage internals, NoSQL storage, or distributed hash tables

  • Strong background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations

  • Strong understanding of the Apache Big Data ecosystem and over 3+ years of experience in systems software, including file systems

  • Recognized contributions to open source projects

  • Experience using projects such as Hive, Pig, MapReduce, HBase, etc., is a big plus

  • Good Understanding of storage development, RAFT replication framework, or equivalent distributed consensus frameworks

Why this role matters: 

You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that powers CDP and keeps it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modeling.

 Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardize best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-REMOTE

#LI-ZC1 

Top Skills

Apache Ozone
Apache Ratis
C++
Hadoop
Hbase
Hive
Java
Mapreduce
Pig
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alot, CA
3,092 Employees
Year Founded: 2008

What We Do

At Cloudera, we believe that data can make what is impossible today, possible tomorrow. We empower people to transform complex data into clear and actionable insights. Cloudera delivers an enterprise data cloud for any data, anywhere, from the Edge to AI. Powered by the relentless innovation of the open source community,

Similar Jobs

Deepgram Logo Deepgram

Solutions Engineer

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
Remote
29 Locations
150 Employees
160K-200K Annually

Deepgram Logo Deepgram

Customer Success Engineer (EMEA)

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
In-Office or Remote
30 Locations
150 Employees
135K-180K Annually

Deepgram Logo Deepgram

Solutions Architect

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
Remote
29 Locations
150 Employees
140K-200K Annually

Deepgram Logo Deepgram

Solutions Architect

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
Remote
29 Locations
150 Employees
140K-200K Annually

Similar Companies Hiring

Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account