Oneture Technologies Jobs

Data Engineer

Oneture Technologies

Data Engineer

Reposted 14 Days Ago

Be an Early Applicant

Mumbai, Maharashtra, IND

In-Office

Senior level

Artificial Intelligence • Big Data • Information Technology • Software

The Role

Design, build, and optimize high-throughput ETL/ELT pipelines (Glue/PySpark), orchestrate workflows (Airflow/Step Functions), manage and tune multi-terabyte Redshift clusters, lead migrations from Snowflake/RDBMS, integrate heterogeneous sources, implement monitoring and governance, and collaborate with stakeholders to deliver analytics-ready datasets.

Summary Generated by Built In

Job Title: Data Engineer

Location: Mumbai

Experience: 3-5 Years

Employment Type: Full-time

Position Overview:

We are looking for a highly skilled and hands-on Senior Data Engineer to join our growing data engineering practice in Mumbai. This role requires deep technical expertise in building and managing enterprise-grade data pipelines, with a primary focus on Amazon Redshift, AWS Glue, and data orchestration using Airflow or Step Functions. You will be responsible for building scalable, high-performance data workflows that ingest and process multi-terabyte-scale data across complex, concurrent environments.

The ideal candidate is someone who thrives in solving performance bottlenecks, has led or participated in data warehouse migrations (e.g., Snowflake to Redshift), and is confident interfacing with business stakeholders to translate requirements into robust data solutions.

Key Responsibilities:

● Design, develop, and maintain high-throughput ETL/ELT pipelines using AWS Glue (PySpark), orchestrated via Apache Airflow or AWS Step Functions.

● Own and optimize large-scale Amazon Redshift clusters and managing high concurrency workloads for very large user base:

● Lead and contribute to migration projects from Snowflake or traditional RDBMS to Redshift, ensuring minimal downtime and robust validation.

● Integrate and normalize data from heterogeneous sources including REST APIs, AWS Aurora (MySQL/Postgres), streaming inputs, and flat files.

● Implement intelligent caching strategies, leverage EC2 and serverless compute (Lambda, Glue) for custom transformations and processing at scale.

● Write advanced SQL for analytics, data reconciliation, and validation, demonstrating strong SQL development and tuning experience.

● Implement comprehensive monitoring, alerting, and logging for all data pipelines to ensure reliability, availability, and cost optimization.

● Collaborate directly with product managers, analysts, and client-facing teams to gather requirements and deliver insights-ready datasets.

● Champion data governance, security, and lineage, ensuring data is auditable and well-documented across all environments.

Required Qualifications & Experience:

● 3-5 years of core data engineering experience, especially focused in Amazon Redshift hands-on performance tuning and large-scale management capacity.

● Demonstrated experience handling multi-terabyte Redshift clusters, concurrent query loads, and managing complex workload segmentation and queue priorities.

● Strong experience with AWS Glue (PySpark) for large-scale ETL jobs.

● Solid understanding and implementation experience of workflow orchestration using Apache Airflow or AWS Step Functions.

● Strong proficiency in Python, advanced SQL, and data modeling concepts.

● Familiarity with CI/CD pipelines, Git, DevOps processes, and infrastructure-as-code concepts.

Preferred/Bonus Skills:

● Experience with Amazon Athena, Lake Formation, or S3-based data lakes.

● Hands-on participation in Snowflake, BigQuery, or Teradata migration projects.

● AWS Certifications such as:

○ AWS Certified Data Analytics – Specialty

○ AWS Certified Solutions Architect – Associate/Professional

● Exposure to real-time streaming architectures or Lambda architectures.

Soft Skills & Expectations:

● Excellent communication skills — must be able to confidently engage with both technical and non-technical stakeholders, including clients.

● Strong problem-solving mindset and a keen attention to performance, scalability, and reliability.

● Demonstrated ability to work independently, lead tasks, and take ownership of large-scale systems.

● Comfortable working in a fast-paced, dynamic, and client-facing environment.

Skills Required

3-5 years of core data engineering experience
Hands-on Amazon Redshift performance tuning and large-scale cluster management
Experience handling multi-terabyte Redshift clusters and high concurrency workloads
Strong experience with AWS Glue (PySpark) for large-scale ETL jobs
Workflow orchestration experience with Apache Airflow or AWS Step Functions
Proficiency in Python
Advanced SQL development and tuning experience
Knowledge of data modeling concepts
Experience integrating data from REST APIs, Aurora (MySQL/Postgres), streaming, and flat files
Familiarity with CI/CD, Git, DevOps processes, and infrastructure-as-code
Experience with Snowflake or RDBMS to Redshift migration projects
Experience with Amazon Athena, Lake Formation, or S3-based data lakes
Hands-on participation in Snowflake, BigQuery, or Teradata migration projects
AWS Certifications (Data Analytics Specialty or Solutions Architect)
Exposure to real-time streaming architectures or Lambda architectures

View all jobs at Oneture Technologies

View Oneture Technologies Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Mumbai

106 Employees

Year Founded: 2016

What We Do

Oneture Technologies is a cloud-first, full-service digital solutions provider specializing in helping enterprises harness the power of automation and data analytics, transforming ideas into business reality by solving technology challenges through innovation.