Business Area:
EngineeringSeniority Level:
Mid-Senior levelJob Description:
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.
At Cloudera, our Data Services Pillar is the heart of data innovation. We don’t just work with technology; we build it. Our mission is to empower data practitioners by creating seamless, enterprise-grade experiences for data engineering, warehousing, streaming, operational databases, and AI.
This is your opportunity to build cloud-native solutions that are deployable anywhere—whether in massive clusters on any cloud provider or in private data centers. You’ll work with cutting-edge technologies like Trino, Spark, Airflow, and advanced AI inferencing systems to shape the future of analytics. Your code will directly influence how data engineers, analysts, and developers worldwide find value in their data.
We believe in the power of open source. You’ll collaborate with project committers, contributing upstream to keep technologies like Apache Hive and Impala evolving. You’ll harden these engines for rock-solid security, optimize them for peak performance, and make them effortlessly run across all environments.
Join us and help build the trusted, cloud-native platform that powers insights for the most data-intensive companies on the planet.
Cloudera is seeking an exceptional and passionate Senior Software Engineer in Test with a background in distributed systems to join the Storage Engineering team, which focuses on building Apache Ozone. The Storage team is responsible for the primary storage and storage access layers which are core to the Cloudera Data Platform. Apache Ozone (https://ozone.apache.org) is an open-source project intended to build a massively scalable distributed object store. Ozone is designed to scale to thousands of nodes, tens of billions of objects and overcome the limitations of the Hadoop Distributed File System (HDFS).
As a Senior Software Engineer in Test, you will…
Review, simplify, and rationalize already existing test cases and our internal testing framework code.
Prepare and implement test plans for newly developed features, and be part of the design process to ensure that testability is a concern from the beginning of the feature development.
Review and work on the different levels of testing within open source projects.
Work with our internal teams to integrate different layers of tests into our internal workflows related to development and supporting our customers.
Will be responsible for continuously increasing the quality of the storage layer within Cloudera's Data Platform
Develop an understanding of popular open source projects of Apache Hadoop; hyperscale cloud platforms like AWS, Azure, and Container technologies like Kubernetes and Docker.
We are excited if you have…
Strong programming skills in Python and any of the following languages: Java/JavaScript
Ability to design, build, and maintain automated testing frameworks, tools, and automated test suites, in Python (pytest), preferred or Java (TestNG/JUnit).
Sound knowledge of test methodologies, including creation of test cases and test plans.
Good Debugging skills, esp. involving distributed systems, preferably on Linux
Ability to work closely with the Engineering teams and come up with test scenarios for new features, involving Big Data technologies.
Working knowledge in storage systems and experience in developing and executing comprehensive storage testing strategies, evaluating functional, performance, scalability, stress, integrity, and security aspects of storage systems will be considered a strong asset.
Ability to design and maintain CI/CD pipelines for enabling fast-paced, low-touch releases of our product
Ability to work effectively both independently and as part of a team.
Knowledge of Public Clouds (AWS/Azure) and/or Container Technologies (Docker, Kubernetes) is a plus.
You may also have...
BS/MS in Computer Science or related field
4+ years experience in test development, automation framework and tools development.
Strong knowledge in back-end testing on any of the following: Web Services, Databases, enterprise storage products, or large-scale distributed systems.
Strong knowledge in popular test automation frameworks and test automation methodologies.
Familiarity with DevOps technologies such as Docker, Kubernetes, Ansible, Jenkins, Github, Maven, etc
Excellent communication and collaboration skills.
Comfortable working in fast-paced environments
Working knowledge of Apache Hive, Impala, Hue, and the Big Data ecosystem is an added advantage.
Why this role matters:
You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that power CDP and keep it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modelling.
Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardise best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.
What you can expect from us:
Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Paid Volunteer Time
Employee Resource Groups
EEO/VEVRAA
#LI-ZC1
#LI-HYBRID
Top Skills
What We Do
At Cloudera, we believe that data can make what is impossible today, possible tomorrow. We empower people to transform complex data into clear and actionable insights. Cloudera delivers an enterprise data cloud for any data, anywhere, from the Edge to AI. Powered by the relentless innovation of the open source community,







