Senior Data Engineer

Posted 3 Days Ago
Be an Early Applicant
Lahore, Punjab, PAK
In-Office
Senior level
Software • Database • Analytics
The Role
Design, build, and optimize scalable ETL/ELT data pipelines and enterprise data warehouse solutions using PySpark, AWS Glue, EMR, and Amazon Redshift. Perform performance tuning, ensure data quality and governance, troubleshoot production incidents, and collaborate with stakeholders and distributed teams to support analytics and reporting.
Summary Generated by Built In

Position: Senior Data Engineer
Experience: 5–8 Years
Location: Hybrid (Pakistan)

Note: (Candidates must be available during UAE business hours and follow UAE public holidays)

Job Summary

We are seeking a highly skilled Senior Data Engineer with 5–8 years of experience in designing, developing, and optimizing large-scale data platforms and ETL/ELT pipelines. The ideal candidate will have strong hands-on expertise in PySpark, AWS Glue, Amazon EMR, Amazon Redshift, and SQL-based data warehousing, along with proven experience in performance tuning and data optimization.

The candidate will work closely with a UAE-based customer and must be comfortable collaborating with distributed teams while adhering to UAE working hours and holiday schedules.

Key Responsibilities
  • Design, develop, and maintain scalable data pipelines using PySpark, AWS Glue, and Amazon EMR.
  • Build and optimize data ingestion, transformation, and processing frameworks for structured and semi-structured data.
  • Develop and maintain enterprise data warehouse solutions using Amazon Redshift.
  • Write complex SQL queries, stored procedures, and data transformations to support analytics and reporting requirements.
  • Implement ETL/ELT processes to move data efficiently across multiple systems and platforms.
  • Perform performance tuning and optimization of Spark jobs, ETL pipelines, SQL queries, and Redshift workloads.
  • Ensure data quality, integrity, security, and governance across data platforms.
  • Troubleshoot production issues and perform root cause analysis for data-related incidents.
  • Collaborate with business stakeholders, analysts, architects, and engineering teams to understand data requirements.
  • Participate in code reviews, technical design discussions, and best-practice implementation.
  • Monitor data pipelines and proactively identify opportunities for performance improvements and automation.
  • Create and maintain technical documentation, data models, and operational procedures.
Required Skills & ExperienceMust-Have Skills
  • 5–8 years of experience in Data Engineering and Data Warehousing.
  • Strong hands-on experience with PySpark.
  • Extensive experience with AWS Glue.
  • Experience building and managing workloads on Amazon EMR.
  • Strong expertise in Amazon Redshift.
  • Excellent SQL development and query optimization skills.
  • Strong understanding of Data Warehousing concepts, dimensional modeling, and ETL/ELT processes.
  • Experience in performance tuning of Spark jobs, SQL queries, ETL pipelines, and data warehouse workloads.
  • Experience handling large-scale datasets and distributed data processing.
  • Strong debugging, troubleshooting, and analytical skills.
Preferred Skills
  • Experience with additional AWS services such as S3, IAM, CloudWatch, Lambda, and Step Functions.
  • Knowledge of CI/CD pipelines and DevOps practices for data platforms.
  • Experience with workflow orchestration tools.
  • Familiarity with data governance, security, and compliance practices.
  • Exposure to Agile/Scrum development methodologies.
Qualifications
  • Bachelor's degree in Computer Science, Software Engineering, Information Technology, or a related field.
  • Relevant AWS certifications will be considered an advantage.
Soft Skills
  • Strong communication and stakeholder management skills.
  • Ability to work independently in a remote environment.
  • Excellent problem-solving and analytical thinking abilities.
  • Ability to collaborate effectively with cross-functional and geographically distributed teams.
  • Strong ownership mindset and commitment to delivering high-quality solutions.

Skills Required

  • 5-8 years of experience in Data Engineering and Data Warehousing
  • Hands-on experience with PySpark
  • Extensive experience with AWS Glue
  • Experience building and managing workloads on Amazon EMR
  • Strong expertise in Amazon Redshift
  • Excellent SQL development and query optimization skills
  • Understanding of data warehousing concepts, dimensional modeling, and ETL/ELT processes
  • Experience in performance tuning of Spark jobs, SQL queries, ETL pipelines, and data warehouse workloads
  • Experience handling large-scale datasets and distributed data processing
  • Strong debugging, troubleshooting, and analytical skills
  • Bachelor's degree in Computer Science, Software Engineering, Information Technology, or a related field
  • Experience with AWS services such as S3, IAM, CloudWatch, Lambda, and Step Functions
  • Knowledge of CI/CD pipelines and DevOps practices for data platforms
  • Experience with workflow orchestration tools
  • Familiarity with data governance, security, and compliance practices
  • Exposure to Agile/Scrum development methodologies
  • Relevant AWS certifications
  • Strong communication and stakeholder management skills
  • Ability to work independently in a remote/hybrid environment and follow UAE business hours/holidays
  • Strong ownership mindset and commitment to delivering high-quality solutions
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Andover, MA
324 Employees
Year Founded: 2007

What We Do

NorthBay is an AWS Premier Partner focused on Database & Application migrations, data & analytics, DevOps & DataOps, application modernization and ML/Ai. Our practice areas include big data and analytics, machine learning, artificial intelligence and database migrations.

Similar Jobs

Binance Logo Binance

Senior Back-end Engineer

Blockchain • Fintech • Software • Cryptocurrency • Metaverse
In-Office or Remote
24 Locations
7696 Employees

Binance Logo Binance

Senior Software Engineer

Blockchain • Fintech • Software • Cryptocurrency • Metaverse
Remote or Hybrid
18 Locations
7696 Employees

Binance Logo Binance

Senior Software Engineer

Blockchain • Fintech • Software • Cryptocurrency • Metaverse
In-Office or Remote
24 Locations
7696 Employees

Northbay Logo Northbay

Senior Data Engineer

Software • Database • Analytics
In-Office or Remote
2 Locations
324 Employees
5-7 Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account