Data Engineer

Posted 14 Days Ago
Be an Early Applicant
Mumbai, Maharashtra, IND
In-Office
Senior level
Artificial Intelligence • Big Data • Information Technology • Software
The Role
Design, build, and optimize high-throughput ETL/ELT pipelines (Glue/PySpark), orchestrate workflows (Airflow/Step Functions), manage and tune multi-terabyte Redshift clusters, lead migrations from Snowflake/RDBMS, integrate heterogeneous sources, implement monitoring and governance, and collaborate with stakeholders to deliver analytics-ready datasets.
Summary Generated by Built In
Job Title: Data Engineer
Location: Mumbai
Experience: 3-5 Years
Employment Type: Full-time
Position Overview:
We are looking for a highly skilled and hands-on Senior Data Engineer to join our growing data engineering practice in Mumbai. This role requires deep technical expertise in building and managing enterprise-grade data pipelines, with a primary focus on Amazon Redshift, AWS Glue, and data orchestration using Airflow or Step Functions. You will be responsible for building scalable, high-performance data workflows that ingest and process multi-terabyte-scale data across complex, concurrent environments.
The ideal candidate is someone who thrives in solving performance bottlenecks, has led or participated in data warehouse migrations (e.g., Snowflake to Redshift), and is confident interfacing with business stakeholders to translate requirements into robust data solutions.
Key Responsibilities:
●      Design, develop, and maintain high-throughput ETL/ELT pipelines using AWS Glue (PySpark), orchestrated via Apache Airflow or AWS Step Functions.
●      Own and optimize large-scale Amazon Redshift clusters and managing high concurrency workloads for very large user base:
●      Lead and contribute to migration projects from Snowflake or traditional RDBMS to Redshift, ensuring minimal downtime and robust validation.
●      Integrate and normalize data from heterogeneous sources including REST APIs, AWS Aurora (MySQL/Postgres), streaming inputs, and flat files.
●      Implement intelligent caching strategies, leverage EC2 and serverless compute (Lambda, Glue) for custom transformations and processing at scale.
●      Write advanced SQL for analytics, data reconciliation, and validation, demonstrating strong SQL development and tuning experience.
●      Implement comprehensive monitoring, alerting, and logging for all data pipelines to ensure reliability, availability, and cost optimization.
●      Collaborate directly with product managers, analysts, and client-facing teams to gather requirements and deliver insights-ready datasets.
●      Champion data governance, security, and lineage, ensuring data is auditable and well-documented across all environments.

Required Qualifications & Experience:
●      3-5 years of core data engineering experience, especially focused in Amazon Redshift hands-on performance tuning and large-scale management capacity.
●      Demonstrated experience handling multi-terabyte Redshift clusters, concurrent query loads, and managing complex workload segmentation and queue priorities.
●      Strong experience with AWS Glue (PySpark) for large-scale ETL jobs.
●      Solid understanding and implementation experience of workflow orchestration using Apache Airflow or AWS Step Functions.
●      Strong proficiency in Python, advanced SQL, and data modeling concepts.
●      Familiarity with CI/CD pipelines, Git, DevOps processes, and infrastructure-as-code concepts.

Preferred/Bonus Skills:
●      Experience with Amazon Athena, Lake Formation, or S3-based data lakes.
●      Hands-on participation in Snowflake, BigQuery, or Teradata migration projects.
●      AWS Certifications such as:
○      AWS Certified Data Analytics – Specialty
○      AWS Certified Solutions Architect – Associate/Professional
●      Exposure to real-time streaming architectures or Lambda architectures.

Soft Skills & Expectations:
●      Excellent communication skills — must be able to confidently engage with both technical and non-technical stakeholders, including clients.
●      Strong problem-solving mindset and a keen attention to performance, scalability, and reliability.
●      Demonstrated ability to work independently, lead tasks, and take ownership of large-scale systems.
●      Comfortable working in a fast-paced, dynamic, and client-facing environment.

Skills Required

  • 3-5 years of core data engineering experience
  • Hands-on Amazon Redshift performance tuning and large-scale cluster management
  • Experience handling multi-terabyte Redshift clusters and high concurrency workloads
  • Strong experience with AWS Glue (PySpark) for large-scale ETL jobs
  • Workflow orchestration experience with Apache Airflow or AWS Step Functions
  • Proficiency in Python
  • Advanced SQL development and tuning experience
  • Knowledge of data modeling concepts
  • Experience integrating data from REST APIs, Aurora (MySQL/Postgres), streaming, and flat files
  • Familiarity with CI/CD, Git, DevOps processes, and infrastructure-as-code
  • Experience with Snowflake or RDBMS to Redshift migration projects
  • Experience with Amazon Athena, Lake Formation, or S3-based data lakes
  • Hands-on participation in Snowflake, BigQuery, or Teradata migration projects
  • AWS Certifications (Data Analytics Specialty or Solutions Architect)
  • Exposure to real-time streaming architectures or Lambda architectures
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
106 Employees
Year Founded: 2016

What We Do

Oneture Technologies is a cloud-first, full-service digital solutions provider specializing in helping enterprises harness the power of automation and data analytics, transforming ideas into business reality by solving technology challenges through innovation.

Similar Jobs

HERE Technologies Logo HERE Technologies

Data Engineer

Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
Hybrid
Mumbai, Maharashtra, IND
6000 Employees

HERE Technologies Logo HERE Technologies

Data Engineer

Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
Hybrid
Mumbai, Maharashtra, IND
6000 Employees

Capco Logo Capco

Data Engineer

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Hybrid
Pune, Maharashtra, IND
6000 Employees

ZS Logo ZS

Consultant

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
3 Locations
15000 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account