Aligned Automation

Sr. Data Engineer (Pyspark, SQL, ETL)

Reposted 18 Days Ago

Be an Early Applicant

Pune, Mahārāshtra, IND

In-Office

Senior level

Business Intelligence

The Role

Design, build, and maintain large-scale PySpark/Spark data pipelines and ETL/ELT workflows. Optimize SQL queries, ensure data quality and governance, collaborate with architects, data scientists, and analysts, monitor production workflows, conduct code and architecture reviews, and mentor junior engineers.

Summary Generated by Built In

Sr. Data Engineer - PySpark, ETL, SQL

Aligned Automation Based in Pune, Maharashtra, India office.

About the job

A ‘Better Together’ philosophy towards building a better world

Aligned Automation is a strategic service provider that partners with Fortune 500 leaders to digitize enterprise operations and enable business strategies. We believe we can create positive, lasting change in the way our clients work while advancing the global impact of their business solutions for a more optimistic and better world. We are passionate about building and sustaining an inclusive and equitable workplace where all people can develop and thrive. Enriched by our “4C’s” – Care, Courage, Curiosity, and Collaboration – our culture supports solutions that empower the possible.

Mid-Level Position based out of Pune (8 -12 years)

Job Description:

Job Summary:

We are looking for a highly experienced and motivated Senior Data Engineer with a strong background in PySpark, ETL processes, and SQL. The ideal candidate should have deep technical expertise in designing and building scalable data pipelines, data integration, and transformation workflows in distributed environments. This role will play a critical part in enabling data-driven decision-making across the organization.

Key Responsibilities:

Design, develop, and maintain large-scale, distributed data processing systems using PySpark on big data platforms (Hadoop/Spark).
Build and automate ETL/ELT pipelines to extract data from various structured and unstructured sources.
Optimize and troubleshoot SQL queries for performance, scalability, and accuracy.
Work closely with Data Architects, Data Scientists, and Analysts to deliver clean, structured, and reliable data for business use.
Implement best practices for data modeling, data quality, and data governance.
Monitor and enhance data workflows to ensure reliability and performance in production.
Contribute to technical discussions, architectural reviews, and code reviews.
Mentor junior data engineers and support their technical growth.

Required Skills & Qualifications:

8–12 years of experience in Data Engineering roles.
Strong expertise in PySpark and Apache Spark for large-scale data processing.
Proven experience designing and building ETL pipelines in production environments.
Deep understanding and hands-on experience in writing complex and optimized SQL queries.
Experience working with big data technologies (e.g., Hadoop, Hive, HDFS, Delta Lake).
Familiarity with data warehouse platforms like Snowflake, Redshift, or BigQuery is a plus.
Solid understanding of data architecture, data modeling, and data quality frameworks.
Experience with cloud platforms (AWS, Azure, or GCP) is preferred.
Strong problem-solving and debugging skills.
Excellent communication and collaboration skills.

Preferred Qualifications:

Experience with workflow orchestration tools like Airflow, Apache NiFi, or similar.
Knowledge of DevOps practices and tools for data engineering (CI/CD, Git, Jenkins).
Experience with containerization and orchestration tools (Docker, Kubernetes) is a plus.

Education:

Bachelor’s or Master’s degree in Computer Science, Information Technology, Engineering, or a related field.

Skills Required

8-12 years of experience in Data Engineering roles
Strong expertise in PySpark and Apache Spark for large-scale data processing
Proven experience designing and building ETL/ELT pipelines in production
Deep understanding and hands-on experience writing complex and optimized SQL queries
Experience with big data technologies: Hadoop, Hive, HDFS, Delta Lake
Familiarity with data warehouse platforms (Snowflake, Redshift, BigQuery)
Experience with cloud platforms (AWS, Azure, or GCP)
Experience with workflow orchestration tools (Airflow, Apache NiFi)
Knowledge of DevOps practices and tools for data engineering (CI/CD, Git, Jenkins)
Experience with containerization and orchestration (Docker, Kubernetes)
Solid understanding of data architecture, data modeling, and data quality frameworks
Bachelor's or Master's degree in Computer Science, Information Technology, Engineering, or related field
Strong problem-solving, debugging, communication, and collaboration skills
Experience mentoring junior data engineers

View all jobs at Aligned Automation

View Aligned Automation Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Irving, TX

344 Employees

Year Founded: 2018

What We Do

Technology, society, economy, policy – all moving at breakneck speed in our 21st century world. You’re feeling the pressure to quickly implement new business models, find new value, make split-second informed decisions and keep one step ahead of customers. How? The answer lies in the ability to make quick, accurate and sustainable business decisions. We believe digital offers a way of doing things better – but the journey to transformation doesn’t have to be painful. At Aligned Automation, we work hard to digitally enable your business strategy – connecting processes, technologies and people to unlock value and drive critical business outcomes.