Role & Responsibilities:
- Data Pipeline Development & Optimization:
- Design, build, and maintain scalable and reliable data pipelines to support analytics, ML models, and business reporting.
- Collaborate with data scientists and analysts to ensure data is available, clean, and optimized for downstream use.
- Implement data quality checks, monitoring, and validation processes.
- Data Architecture & Integration:
- Work with cross-functional teams to design efficient ETL/ELT workflows using modern data tools.
- Integrate data from multiple sources (databases, APIs, third-party tools) into centralized storage solutions (data lakes/warehouses).
- Support cloud-based infrastructure for data storage and retrieval.
- Performance & Scalability:
- Monitor, troubleshoot, and optimize existing data pipelines to handle large-scale, real-time data flows.
- Implement best practices for query optimization and cost-efficient data storage.
- Ensure data is available and accessible for business-critical operations.
- Collaboration & Documentation:
- Partner with product, engineering, and business stakeholders to understand data requirements.
- Document data workflows, schemas, and best practices.
- Support a culture of data reliability, governance, and security.
Requirements:
- Proficiency in Python and SQL for data engineering tasks.
- Strong understanding of ETL/ELT processes, data warehousing, and data modeling.
- Hands-on experience with cloud platforms (AWS, GCP, or Azure) and data storage solutions (BigQuery, Redshift, Snowflake, etc.).
- Familiarity with data orchestration tools Airflow, Airbyte is a must.
- Experience with containerization & deployment tools (Docker, Kubernetes) is a plus.
- Knowledge of data governance, security, and best practices for handling sensitive data.
- Familiarity to work with Git and GitHub.
- Dataform is a must
- Strong skills in eliciting requirements from cross-functional stakeholders and translating them into actionable data engineering tasks.
Experience:
- 2+ years in data engineering, building and maintaining data pipelines.
- 2+ years in SQL and Python development for production environments.
- Experience working in fast-growing startup environments is a plus.
- Exposure to real-time data processing frameworks (Kafka, Spark, Flink) is a plus.
Top Skills
What We Do
At Soum, we're on a mission to build a marketplace that enables users to buy and sell anything online with convenience, trust, and ease starting in the MENA market. We began with second-hand electronics in KSA, becoming market leaders, and are now expanding to used cars and collectibles. In just two years, we've facilitated hundreds of thousands of transactions across 150+ locations in the Kingdom, maintained 10,000 active listings at any given time, earned the distinction of being the 10th most downloaded e-commerce app in KSA for 2023, and received more than 5 million app downloads.
Our team, Soumers, have worked in over 14 countries from the US to South Korea. We are always on the look out for the best and brightest to join. We look beyond resumes and titles, we look for the innovators, the hungry, the go-getters. We offer them the autonomy to experiment, learn, fail, and grow.
Every transaction on Soum is guaranteed; we hold funds until the item is delivered and confirmed by the buyer, thanks to our proprietary fulfillment and payment systems. This ensures a secure and reliable marketplace experience.
Soum has pioneered a solution that have transformed digital commerce for sellers in the wider region:
Individuals: Enjoy unique home pick-up and delivery services, secure payments, and transaction guarantees for worry-free selling.
Small Businesses: Sell faster and reach a wider audience with our dedicated "Merchant App."
Teleco and retail partners: Gain optimal returns with bulk-selling options, comprehensive inspections, and efficient warehouse management.
We deploy AI models to power our marketplace with personalized recommendations, dynamic pricing, predictive analytics, and fraud detection, ensuring a smarter, safer, and more efficient platform for our users.
Soum is backed by local and regional investors including, amongst others, Jahez Group, Isometry Capital, and Al Rajhi Partners and have raised $22Mn to date.






