Sr. Data Engineer (Pyspark, SQL, ETL)

Posted Yesterday
Be an Early Applicant
Pune, Mahārāshtra, IND
In-Office
Senior level
Business Intelligence
The Role
Design, build, and maintain large-scale PySpark/Spark data pipelines and ETL/ELT workflows. Optimize SQL queries, ensure data quality and governance, collaborate with architects, data scientists, and analysts, monitor production workflows, conduct code and architecture reviews, and mentor junior engineers.
Summary Generated by Built In

Sr. Data Engineer - PySpark, ETL, SQL

Aligned Automation Based in Pune, Maharashtra, India office.

About the job

A ‘Better Together’ philosophy towards building a better world

Aligned Automation is a strategic service provider that partners with Fortune 500 leaders to digitize enterprise operations and enable business strategies. We believe we can create positive, lasting change in the way our clients work while advancing the global impact of their business solutions for a more optimistic and better world. We are passionate about building and sustaining an inclusive and equitable workplace where all people can develop and thrive. Enriched by our “4C’s” – Care, Courage, Curiosity, and Collaboration – our culture supports solutions that empower the possible.

Mid-Level Position based out of Pune (8 -12 years)

Job Description:

 

Job Summary:

We are looking for a highly experienced and motivated Senior Data Engineer with a strong background in PySpark, ETL processes, and SQL. The ideal candidate should have deep technical expertise in designing and building scalable data pipelines, data integration, and transformation workflows in distributed environments. This role will play a critical part in enabling data-driven decision-making across the organization.

 

Key Responsibilities:

  • Design, develop, and maintain large-scale, distributed data processing systems using PySpark on big data platforms (Hadoop/Spark).
  • Build and automate ETL/ELT pipelines to extract data from various structured and unstructured sources.
  • Optimize and troubleshoot SQL queries for performance, scalability, and accuracy.
  • Work closely with Data Architects, Data Scientists, and Analysts to deliver clean, structured, and reliable data for business use.
  • Implement best practices for data modeling, data quality, and data governance.
  • Monitor and enhance data workflows to ensure reliability and performance in production.
  • Contribute to technical discussions, architectural reviews, and code reviews.
  • Mentor junior data engineers and support their technical growth.

Required Skills & Qualifications:

  • 8–12 years of experience in Data Engineering roles.
  • Strong expertise in PySpark and Apache Spark for large-scale data processing.
  • Proven experience designing and building ETL pipelines in production environments.
  • Deep understanding and hands-on experience in writing complex and optimized SQL queries.
  • Experience working with big data technologies (e.g., Hadoop, Hive, HDFS, Delta Lake).
  • Familiarity with data warehouse platforms like Snowflake, Redshift, or BigQuery is a plus.
  • Solid understanding of data architecture, data modeling, and data quality frameworks.
  • Experience with cloud platforms (AWS, Azure, or GCP) is preferred.
  • Strong problem-solving and debugging skills.
  • Excellent communication and collaboration skills.

Preferred Qualifications:

  • Experience with workflow orchestration tools like Airflow, Apache NiFi, or similar.
  • Knowledge of DevOps practices and tools for data engineering (CI/CD, Git, Jenkins).
  • Experience with containerization and orchestration tools (Docker, Kubernetes) is a plus.

Education:

  • Bachelor’s or Master’s degree in Computer Science, Information Technology, Engineering, or a related field.


Skills Required

  • 8-12 years of experience in Data Engineering roles
  • Strong expertise in PySpark and Apache Spark for large-scale data processing
  • Proven experience designing and building ETL/ELT pipelines in production
  • Deep understanding and hands-on experience writing complex and optimized SQL queries
  • Experience with big data technologies: Hadoop, Hive, HDFS, Delta Lake
  • Familiarity with data warehouse platforms (Snowflake, Redshift, BigQuery)
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Experience with workflow orchestration tools (Airflow, Apache NiFi)
  • Knowledge of DevOps practices and tools for data engineering (CI/CD, Git, Jenkins)
  • Experience with containerization and orchestration (Docker, Kubernetes)
  • Solid understanding of data architecture, data modeling, and data quality frameworks
  • Bachelor's or Master's degree in Computer Science, Information Technology, Engineering, or related field
  • Strong problem-solving, debugging, communication, and collaboration skills
  • Experience mentoring junior data engineers
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Irving, TX
344 Employees
Year Founded: 2018

What We Do

Technology, society, economy, policy – all moving at breakneck speed in our 21st century world. You’re feeling the pressure to quickly implement new business models, find new value, make split-second informed decisions and keep one step ahead of customers. How? The answer lies in the ability to make quick, accurate and sustainable business decisions. We believe digital offers a way of doing things better – but the journey to transformation doesn’t have to be painful. At Aligned Automation, we work hard to digitally enable your business strategy – connecting processes, technologies and people to unlock value and drive critical business outcomes.

Similar Jobs

LogicMonitor Logo LogicMonitor

Software Engineer

Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software
Easy Apply
Hybrid
2 Locations
1100 Employees
3-3 Annually

Mastercard Logo Mastercard

Specialist, Transaction Services

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Hybrid
Pune, Mahārāshtra, IND
38800 Employees
50K-70K Annually

Mastercard Logo Mastercard

Lead Product Manager

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Hybrid
Pune, Mahārāshtra, IND
38800 Employees

Mastercard Logo Mastercard

Senior Specialist, Transaction Services

Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Hybrid
Pune, Mahārāshtra, IND
38800 Employees

Similar Companies Hiring

Energy CX Thumbnail
Greentech • Professional Services • Business Intelligence • Consulting • Energy • Financial Services • Utilities
Chicago, IL
108 Employees
Compa Thumbnail
Artificial Intelligence • HR Tech • Software • Business Intelligence
Irvine, California
75 Employees
Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account