Staff Software Engineer, Storage (Java or C++))

Posted 11 Days Ago
Be an Early Applicant
Hiring Remotely in Praha, Hlavní město Praha
In-Office or Remote
Mid level
Big Data • Software • Analytics
The Role
As a Staff Software Engineer at Cloudera, you'll design and implement core features of Apache Ozone, mentor junior engineers, and collaborate on distributed systems for big data analytics.
Summary Generated by Built In

Business Area:

Engineering

Seniority Level:

Mid-Senior level

Job Description: 

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance. 

Cloudera is looking for an exceptional and passionate software engineer with a strong distributed systems background to join the Storage Engineering team focused on building Apache Ozone. The Storage team is responsible for primary storage and storage access layers, which are core to the platform. They created and wrote most of the HDFS code and made a huge impact on the big data and cloud computing industry. Apache Ozone (Apache Ozone) provides a massively scalable distributed object store with a distributed file system interface. Ozone is designed to scale to tens of billions of files and blocks, and overcome the limitations of Hadoop Distributed File System (HDFS), namely, millions of small files and managing a huge number of datanodes. 

Ozone is one of the fastest-growing products inside CDP in terms of customer adoption and expansion revenue.

As a Staff Software Engineer, you will…

  • You will be directly involved in the design and implementation of the core feature set of Apache Ozone and Apache Ratis (open-source RAFT implementation) 

  • You will regularly contribute code and design docs to the Apache open-source community.

  • As part of storage engineering, you will support enterprise customers running 100s of petabytes-scale big data analytics and ML/AI pipelines. 

  • You will partner with product managers and cross-functional teams as a part of the Cloudera Data Platform ecosystem in understanding requirements and turning them into a solid design and implementation, and facilitating integration and adoption.

  • You will be responsible for leading and collaborating with a talented group of engineers working on a feature and mentoring junior engineers.

We are excited if you have…

  • Bachelor's +6, Master's 4-6 years of relevant industry experience required

  • Strong backend engineering skill set with expertise in Java, or strong C++ skills, with intermediate Java expertise

  • Passionate about programming. Clean coding habits, attention to detail, and focus on quality

  • Experience with large-scale, distributed systems design and development with a strong understanding of scaling, replication, consistency, and high availability

  • Solid experience with system software design and development with a strong understanding of computer architecture, storage, network, and IO subsystems, and distributed systems

  • Hands-on programmer with strong data structures and algorithms skillset

  • Strong oral and written communication skills

You may also have..

  • Strong background in a distributed storage system, including file systems, database storage internals, NoSQL storage, or distributed hash tables

  • Strong background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations

  • Strong understanding of the Apache Big Data ecosystem and over 3+ years of experience in systems software, including file systems

  • Recognized contributions to open source projects

  • Experience using projects such as Hive, Pig, MapReduce, HBase, etc., is a big plus

  • Good Understanding of storage development, RAFT replication framework, or equivalent distributed consensus frameworks

Why this role matters: 

You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that powers CDP and keeps it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modeling.

Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardize best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-ZC1

#LI-REMOTE

Top Skills

C++
Java
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Palo Alot, CA
3,092 Employees
Year Founded: 2008

What We Do

At Cloudera, we believe that data can make what is impossible today, possible tomorrow. We empower people to transform complex data into clear and actionable insights. Cloudera delivers an enterprise data cloud for any data, anywhere, from the Edge to AI. Powered by the relentless innovation of the open source community,

Similar Jobs

GitLab Logo GitLab

Security Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
In-Office or Remote
36 Locations
2500 Employees

Pfizer Logo Pfizer

Health Representative Primary Care - Severní i Západní Čechy

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Remote or Hybrid
Czech Republic
121990 Employees

Mondelēz International Logo Mondelēz International

Director, Engineering Bakery MEU

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
Czech Republic
90000 Employees

Pfizer Logo Pfizer

Health Representative Primary Care - střední Čechy, Praha 5, Praha západ

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Remote or Hybrid
Czech Republic
121990 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account