We are looking for a hands-on Data Engineer (4-8 years) to build and optimize scalable data pipelines and analytical datasets on the Databricks platform. You will work closely with Analytics/BI, Product, and Business teams to enable data-driven decision-making. Retail / eCommerce domain exposure is a strong plus, along with the ability to translate business needs into reliable and performant data solutions.
Key Responsibilities- Design, develop, and maintain robust ETL/ELT pipelines using Databricks (Spark) and Python (PySpark).
- Develop and optimize complex transformations using SQL (joins, window functions, CTEs, query tuning).
- Build curated datasets and data models to support reporting, dashboards, and advanced analytics use cases.
- Implement pipeline reliability best practices: data quality checks, monitoring, alerting, and reconciliation.
- Optimize Databricks workloads for performance and cost (cluster sizing, partitioning strategies, caching, file formats).
- Work with structured and semi-structured data (JSON, CSV, Parquet/Delta) and handle schema evolution.
- Collaborate with stakeholders to understand business KPIs and deliver data solutions aligned to retail/eCommerce metrics (sales, orders, returns, inventory, customer cohorts).
- Follow engineering best practices for version control (Git), documentation, reusable code patterns, and testing.
- Good to have: Support or migrate Alteryx workflows into Python/Databricks pipelines.
- 4-8 years of experience in Data Engineering / Data Warehousing / Big Data.
- Strong hands-on experience with Databricks (Jobs/Workflows, notebooks, cluster concepts, Spark tuning fundamentals).
- Strong programming skills in Python (PySpark preferred).
- Excellent SQL skills, including performance tuning and writing complex analytical queries.
- Experience building scalable pipelines and working with large datasets in distributed environments.
- Strong understanding of data engineering concepts: ETL/ELT, orchestration, data validation, and observability.
- Familiarity with modern data storage formats and practices (Delta/Parquet, partitioning, incremental loads).
- Retail / eCommerce domain knowledge (customer behavior, funnel metrics, pricing/promotions, inventory, catalog, order lifecycle).
- Alteryx (workflow development, optimization, scheduling, or migration to Databricks).
- Experience with Lakehouse patterns and Delta Lake features (e.g., MERGE, OPTIMIZE, Z-ORDER).
- Experience with orchestration tools (e.g., Airflow, ADF, Databricks Workflows).
- Cloud experience: AWS / Azure / GCP (S3/ADLS/GCS, IAM basics, security controls).
- CI/CD exposure for data pipelines, code reviews, and automated deployments.
- Strong problem-solving skills and a mindset for root-cause analysis.
- Ownership and accountability for production-grade pipelines.
- Ability to communicate with both technical and non-technical stakeholders.
- Comfort working in fast-paced environments with evolving requirements.
eClerx is a global leader in productized services, bringing together people, technology and domain expertise to amplify business results. Our mission is to set the benchmark for client service and success in our industry. Our vision is to be the innovation partner of choice for technology, data analytics and process management services. Since our inception in 2000, we've partnered with top companies across various industries, including financial services, telecommunications, retail, and high-tech. Our innovative solutions and domain expertise help businesses optimize operations, improve efficiency, and drive growth. With over 18,000 employees worldwide, eClerx is dedicated to delivering excellence through smart automation and data-driven insights. At eClerx, we believe in nurturing talent and providing hands-on experience.
eClerx is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability or protected veteran status, or any other legally protected basis, per applicable law.
Similar Jobs
What We Do
eClerx provides business process management, automation and analytics services to a number of Fortune 2000 enterprises, including some of the world's leading financial services, communications, retail, fashion, media & entertainment, manufacturing, travel & leisure, and technology companies. Incorporated in 2000, eClerx is today traded on both the Bombay and National Stock Exchanges of India. The firm employs 16,000+ people across Australia, Canada, Germany, India, Italy, Netherlands, Philippines, Singapore, Thailand, UK, and the USA.








