Data Engineer

Reposted 4 Days Ago
Be an Early Applicant
2 Locations
In-Office
Mid level
Artificial Intelligence • Information Technology • Software
The Role
The Data Engineer designs and optimizes scalable data solutions, manages data pipelines, and implements MLOps principles to support AI systems and enhance data quality and governance.
Summary Generated by Built In

DeepLight AI is a specialist AI and data consultancy with extensive experience implementing intelligent enterprise systems across multiple industries, with particular depth in financial services and banking. Our team combines deep expertise in data science, statistical modeling, AI/ML technologies, workflow automation, and systems integration with a practical understanding of complex business operations. 

The Data Engineer is responsible for designing, implementing, and optimising data pipelines and infrastructure to support our cutting-edge AI systems. The Data Engineer collaborates closely with our multidisciplinary team to ensure the efficient collection, storage, processing, and analysis of large-scale data, enabling us to unlock valuable insights and drive innovation across various domains. 

Responsibilities of the role:

  • Design, build, and optimise scalable data solutions, primarily utilising the Lakehouse architecture to unify data warehousing and data lake capabilities. Advise stakeholders on the strategic choice between Data Warehouse, Data Lake, and Lakehouse architectures based on specific business needs, cost, and latency requirements. 
  • Design, develop, and maintain scalable and reliable data pipelines to ingest, transform, and load diverse datasets from various sources, including structured and unstructured data, streaming data, and real-time feeds. 
  • Implement standards and tooling to ensure ACID properties, schema evolution, and high data quality within the Lakehouse environment. Implement robust data governance frameworks (security, privacy, integrity, compliance, auditing). 
  • Continuously optimize data storage, compute resources, and query performance across the data platform to reduce costs and improve latency for both BI and ML workloads, leveraging techniques such as indexing, partitioning, and parallel processing. 
  • Develop and maintain CI/CD pipelines to automate the entire machine learning lifecycle, from data validation and model training to deployment and infrastructure provisioning. 
  • Deploy, manage, and scale machine learning models into production environments, utilizing MLOps principles for reliable and repeatable operations. 
  • Establish and manage monitoring systems to track model performance metrics, detect data drift (changes in input data), and model decay (degradation in prediction accuracy). 
  • Ensure rigorous version control and tracking for all components: code, datasets, and trained model artifacts (using tools like MLflow or similar). 
  • Create comprehensive documentation, including technical specifications, data flow diagrams, and operational procedures, to facilitate understanding, collaboration, and knowledge sharing.

Requirements
  •  Proven practical experience in designing, building, and optimising solutions using Data Lakehouse architectures (e.g., Databricks, Delta Lake).
  • Strong hands-on experience with managing data ingestion, schema enforcement, ACID properties, and utilizing big data technologies/frameworks like Spark and Kafka. 
  • Expertise in data modeling, ETL/ELT processes, and data warehousing concepts. Proficiency in SQL and scripting languages (e.g., Python, Scala). 
  • Demonstrated practical experience implementing MLOps pipelines for production systems. This includes a solid understanding and implementation experience with MLOps principles: automation, governance, and monitoring of ML models throughout the entire lifecycle. 
  • Experience with CI/CD tools, containerization/orchestration technologies (e.g., Docker, Kubernetes), model serving frameworks (e.g., TensorFlow Serving, Sagemaker), and experiment tracking (e.g., MLflow). 
  • Experience with production monitoring tools to detect data drift or model decay. 
  • Strong hands-on experience with major cloud platforms (e.g., AWS, Azure, GCP) and familiarity with DevOps practices. 
  • Excellent analytical, problem-solving, and communication skills, with the ability to translate complex technical concepts into clear and actionable insights. 
  • Proven ability to work effectively in a fast-paced, collaborative environment, with a passion for innovation and continuous learning 

Benefits

Benefits & Growth Opportunities:

·       Competitive salary and performance bonuses

·       Comprehensive health insurance

·       Professional development and certification support

·       Opportunity to work on cutting-edge AI projects

·       Flexible working arrangements

·       Career advancement opportunities in a rapidly growing AI company

This position offers a unique opportunity to shape the future of AI implementation while working with a talented team of professionals at the forefront of technological innovation. The successful candidate will play a crucial role in driving our company's success in delivering transformative AI solutions to our clients.

Top Skills

AWS
Azure
Ci/Cd
Data Lakehouse
Databricks
Delta Lake
Docker
GCP
Kafka
Kubernetes
Mlflow
Python
Sagemaker
Scala
Spark
SQL
Tensorflow Serving
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Dubai
19 Employees

What We Do

Welcome to Deeplight, the pioneering consultancy dedicated to guiding businesses through every facet of the AI adoption journey. From initial industry awareness to seamless integration and optimization, Deeplight is your strategic partner for unlocking the full potential of Artificial Intelligence.

The Enterprise Innovation Roadmap: Navigating Your AI Journey

EXECUTIVE COACHING: Propel your leadership team to new heights with personalized coaching driven by AI insights and guidance. Deeplight ensures your leadership makes informed decisions and drives innovation at the highest level.

CURRENT STATE ASSESSMENT: Embark on your AI journey with clarity by conducting a data-driven assessment. Gain a comprehensive understanding of your organization's AI readiness, identifying key strengths, weaknesses, and opportunities to maximize your return on AI investment.

DATA READINESS INITIATIVES: Elevate your data game with Deeplight's tailored data cleansing, organization. Bridge the gap between your current data and the needs of successful AI implementation, ensuring your AI models have the fuel they need to thrive.

USE CASE WORKSHOPS: Collaboratively explore the most impactful AI applications for your unique business needs. Deeplight facilitates brainstorming sessions, feasibility studies, and prioritization to identify use cases with the highest potential for success. Turn ideas into actionable strategies with our expert guidance.

CUSTOM MODEL DESIGN: Deeplight goes beyond basic AI models with custom "AiGent" design. Craft intelligent agents tailored to your challenges and goals, ensuring optimal performance and alignment with your strategic vision. Redefine what's possible with AI for your business.

Ready to elevate your business through the power of AI?

Deeplight serves as your beacon of innovation, guiding businesses through the complex process of AI adoption. Connect with us to embark on a transformative AI journey that propels your enterprise to new heights.

Similar Jobs

Nagarro Logo Nagarro

Principal Engineer

Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
In-Office or Remote
Dubai, ARE
19994 Employees

1inch Logo 1inch

Senior Data Engineer

Blockchain • Fintech • Software • Financial Services • Cryptocurrency • Web3
Hybrid
4 Locations
35 Employees
In-Office or Remote
2 Locations
97 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account