Job Title: Cloudera Data Engineer (Offshore – Pakistan)
Location: Pakistan (Offshore)
Experience Required: 5+ Years
Employment Type: Full-Time
We are seeking a highly skilled Cloudera Data Engineer to join our offshore data engineering team in Pakistan. The ideal candidate will be responsible for designing, developing, and optimizing scalable data pipelines within the Cloudera ecosystem. The role involves working closely with data architects, administrators, DevOps, and business stakeholders to ensure reliable and high-performance data processing solutions that support analytics and reporting requirements.
Key ResponsibilitiesDesign, develop, and maintain robust ETL/ELT pipelines using Cloudera ecosystem tools including Apache Spark, Hive, Impala, NiFi, and Oozie.
Build scalable and optimized data pipelines supporting batch and streaming workloads.
Develop reusable frameworks and automation scripts for data ingestion, transformation, cleansing, and loading.
Optimize pipeline performance to ensure high reliability, efficiency, and cost optimization.
Collaborate with Cloudera Administrators to tune system performance and efficiently manage cluster resources.
Monitor cluster utilization, job performance, and throughput using Cloudera Manager and related monitoring tools.
Troubleshoot pipeline failures, resolve performance bottlenecks, and address data quality issues.
Work with DevOps and Infrastructure teams to operationalize data platforms, including:
CI/CD pipeline development
Deployment automation
Environment promotion workflows
Monitoring and observability integration
Participate in incident management, root cause analysis, and preventive maintenance activities.
Collaborate with data architects, administrators, and engineering teams to improve end-to-end data platform operations.
Ensure data pipelines consistently meet defined SLA, latency, and quality standards.
Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field.
Minimum 5+ years of experience in data engineering with strong exposure to Cloudera platform.
Hands-on expertise with:
Apache Spark
Hive
Impala
Apache NiFi
Apache Oozie
Hadoop ecosystem components
Strong experience in SQL and data modeling concepts.
Experience working with large-scale distributed data processing systems.
Strong troubleshooting, debugging, and performance tuning skills.
Understanding of CI/CD, DevOps practices, and automation tools.
Experience working in offshore or distributed delivery models is preferred.
Strong communication and collaboration skills.
Experience working with streaming technologies (Kafka, Spark Streaming, or similar).
Exposure to cloud data platforms or hybrid data architectures.
Familiarity with data governance, security, and compliance practices.
Knowledge of scripting languages such as Python or Shell scripting.
Top Skills
What We Do
Datamatics Technologies (DMT) was established in Dubai. We specialize in providing onsite and offshore professional services, covering the full spectrum of Data Analytics and Data Science domains.
Our experience of working with diverse industry sectors such as Telecoms, Finance, Government and Manufacturing, across multiple regions enables us to engage and deliver for our clients with confidence.
We can offer our full portfolio of services through resource augmentation, managed services, both on T&M or fixed price financial arrangements. Through our end-to-end managed services offering we enable our clients to cut down costs, increase profitability and focus on value addition to their core business activities.
Our project and delivery management team are certified in Agile, PMI and ITIL to ensure the planning and execution are carried out using industry best practices.
We are working with our clients across Middle East and Africa Region.







