Business Area:
EngineeringSeniority Level:
Mid-Senior levelJob Description:
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance.
The Replication Manager team is seeking passionate developers to enhance replication support for the Cloudera Data Platform. The team’s mission is to provide a seamless experience for customers moving data and associated entities to support migration, replication, and disaster recovery.
Replication Manager enables customers to replicate data across data centers or between on-premises and cloud environments. This includes data in HDFS, Ozone, or cloud buckets; Hive, HBase, or Iceberg tables; Ranger permissions; and Atlas lineage. Datasets range from terabytes to petabytes, with challenges such as millions of directories, large file sizes, and near real-time HBase WAL replication.
As a Staff Software Engineer, you will
Build and maintain large-scale replication systems on top of the Cloudera Data Platform stack
Be responsible for our products running in production
Work with a distributed team of engineers to design cloud-based, low RPO, RTO replication architectures
Support replication across multiple Cloudera components like HDFS, Ozone, Hive, HBase, Iceberg, Atlas, and Ranger
Give and take actionable feedback
Mentor junior engineers
Work with product management and occasionally, with field engineers on the product roadmap and early access feature introductions
We’re excited about you if you have:
Masters in Computer Science or related field and 4-6 years of experience - or Bachelors and more than 6 years of relevant industry experience
Strong backend engineering skill set with expertise in Java or Scala or Kotlin
Ability to read large codebases and write succinct, clean code
Experience with system software design and development with an understanding of computer architecture, storage, network, and IO subsystems
Systems/DevOps experience
You may also have
Experience with large-scale, distributed systems design and development with an understanding of scaling, replication, consistency, and high availability
Current expertise with Java/Scala/Kotlin developer ecosystems
Experience with AWS, Azure or GCP
Test automation experience along with Python basics
Background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations
Why this role matters:
You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that powers CDP and keeps it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modeling.
Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardize best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.
What you can expect from us:
Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Paid Volunteer Time
Employee Resource Groups
EEO/VEVRAA
#LI-ZC1
#LI-HYBRID
Top Skills
What We Do
At Cloudera, we believe that data can make what is impossible today, possible tomorrow. We empower people to transform complex data into clear and actionable insights. Cloudera delivers an enterprise data cloud for any data, anywhere, from the Edge to AI. Powered by the relentless innovation of the open source community,








