Data Architect (PySpark, Python, Spark, ETL)

Posted 19 Days Ago
Be an Early Applicant
Lahore, Punjab
Senior level
Software • Database • Analytics
The Role
The role involves designing and developing ETL pipelines using Big Data technologies in AWS. Responsibilities include creating complex data pipelines, mentoring teams, and employing AWS services to enhance data management and architecture.
Summary Generated by Built In

NorthBay Solutions, with a team of 400+ professionals, is a US-based premier AWS partner and the only premier-level partner in the region. We have been serving our clients globally and providing AWS Big Data, Data Lake & Data Warehousing solutions for over a decade now. 
We are currently recruiting for a Tech Lead/Architect with solid experience in data engineering and a passion for Big Data. The ideal candidate will have experience of overall 12+ years and 5+ years specifically in data engineering.
Technology Stack Used & Required Knowledge:

  • Must have 12+ years experience of developing and implementing pipelines on AWS cloud that extract, transform, and load data into an information product that helps the organization reach its strategic goals
  • Must have experience on ETL Data Engineer, Python, Bigdata (PySpark, Spark SQL, Hadoop, Hive) and/or Java
  • Experience in creating and driving large-scale ETL pipelines in AWS-based environment
  • Experience with integration of data from multiple data sources.
  • Overall 5+ years of relevant work experience in Big data engineering, ETL, Data Modeling, and Data Architecture.
  • Strong software development and programming skills with a focus on data using Python/ PySpark and/or Scala for data engineering.
  • Experience and understanding of various core AWS services such as IAM, Cloud Formation, EC2, S3, EMR/Spark, Glue, Lambda, Athena, and Redshift will be a plus
  • Experience with the AWS data management tools such as Data lake and Databricks or AWS Snowflake is a plus

To keep it short, below are some key responsibilities:

  • Design and develop using Big Data technologies.
  • Design and develop cutting edge Big Data, and Cloud technologies to build critical, highly complex distributed systems from scratch
  • Design, develop and manage complex ETL jobs and pipelines. 
  • Act as a critical thought leader, consulting and mentoring other teams as your new systems transform the way this entire firm leverages data

Top Skills

Java
Pyspark
Python
Scala
The Company
Andover, MA
324 Employees
On-site Workplace
Year Founded: 2007

What We Do

NorthBay is an AWS Premier Partner focused on Database & Application migrations, data & analytics, DevOps & DataOps, application modernization and ML/Ai.

Our practice areas include big data and analytics, machine learning, artificial intelligence and database migrations.

Similar Jobs

CureMD Logo CureMD

Marketing AI Analyst

Healthtech • Information Technology • Software
Lahore, Punjab, PAK
875 Employees

CureMD Logo CureMD

Data Engineer - ETL

Healthtech • Information Technology • Software
Lahore, Punjab, PAK
875 Employees

US Mobile Logo US Mobile

Product Experience Analyst - Lahore

Internet of Things • Mobile • Other • Software
Lahore, Punjab, PAK
131 Employees

Dubizzle Labs Logo Dubizzle Labs

Systems Analyst

Information Technology • Consulting
Lahore, Punjab, PAK
349 Employees

Similar Companies Hiring

bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account