Data Engineer | Hadoop, Spark & Cloud Platforms

Posted Yesterday
Be an Early Applicant
Mumbai, Maharashtra, IND
In-Office
Mid level
Fintech • Financial Services
The Role
Design, build, and optimize large-scale ETL pipelines using Hadoop, Spark, and cloud data platforms. Develop batch and real-time workflows, ensure data quality and governance, automate ingestion and deployment, monitor and troubleshoot jobs, and collaborate with analysts and data scientists to support enterprise analytics.
Summary Generated by Built In

Job Summary

Synechron is seeking a skilled ETL Developer with strong expertise in Hadoop ecosystems, Spark, and Informatica to design, develop, and maintain scalable data pipelines supporting enterprise analytics and data warehousing initiatives. This role involves working on large datasets, transforming data, and delivering reliable data integration solutions across on-premise and cloud environments. Your efforts will enable data-driven decision-making, ensure data quality, and support our organization’s strategic focus on scalable and compliant data platforms.

Software Requirements

Required:

  • Hands-on experience with ETL tools: Informatica, Talend, or equivalent (5+ years)

  • Proven expertise in Hadoop ecosystem components: HDFS, Hive, Pig, Sqoop (5+ years)

  • Proficiency in Apache Spark: PySpark, Spark SQL, Spark Streaming

  • Strong programming skills in Python, Java, or Scala for data processing (5+ years)

  • Experience with SQL and relational databases: Oracle, MySQL, PostgreSQL

  • Familiarity with cloud data platforms such as AWS Redshift, Azure Synapse, GCP BigQuery

Preferred:

  • Knowledge of cloud-native data migration and integration tools

  • Exposure to NoSQL databases like DynamoDB or Cassandra

  • Experience with data governance and metadata management tools

Overall Responsibilities

  • Design, develop, and optimize end-to-end ETL pipelines for large-scale data processing and integrations

  • Build and enhance batch and real-time data processing workflows using Spark, Hadoop, and cloud services

  • Convert business and technical requirements into high-performance data solutions aligned with governance standards

  • Perform performance tuning, debugging, and optimization of data workflows and processing jobs

  • Ensure data quality, security, and compliance with enterprise standards and industry regulations

  • Collaborate with data analysts, data scientists, and application teams to maximize data usability and accuracy

  • Automate data ingestion, transformation, and deployment pipelines for operational efficiency

  • Support platform stability by troubleshooting issues, monitoring workflows, and maintaining data lineage

  • Implement and improve data governance, metadata management, and security standards

  • Stay current with emerging data technologies, automation frameworks, and cloud innovations to optimize data architectures

Technical Skills (By Category)

Programming Languages (Essential):

  • Python, Scala, Java (for data processing and automation)

Preferred:

  • Additional scripting or programming skills (Shell, SQL scripting)

Frameworks & Libraries:

  • Spark (PySpark, Spark SQL, Spark Streaming), Hive, Pig

  • Data validation and governance tools (e.g., Atlas, Data Catalogs)

  • AI/ML frameworks such as LangChain, Hugging Face (preferred)

Databases & Storage:

  • Relational: Oracle, PostgreSQL, MySQL

  • NoSQL: DynamoDB, Cassandra (preferred)

Cloud Technologies:

  • AWS: EMR, S3, Glue, CloudFormation, CDK, Redshift (preferred)

  • Azure or GCP data services (desired)

Data Management & Governance:

  • Metadata management, data lineage, data quality frameworks

DevOps & Automation:

  • CI/CD tools: Jenkins, GitHub Actions, TeamCity

  • Infrastructure as Code: Terraform, CloudFormation, Ansible

Experience Requirements

  • 4+ years of experience in designing and developing large-scale data pipelines

  • Proven expertise with Hadoop, Spark, and ETL frameworks in enterprise environments

  • Hands-on experience integrating data within cloud ecosystems and maintaining data quality

  • Familiarity with regulated industries such as finance or banking is preferred

  • Demonstrated ability to troubleshoot performance issues and optimize workflows

Day-to-Day Activities

  • Develop and maintain data pipelines supporting enterprise analytics and reporting

  • Optimize ETL workflows for performance, scalability, and data accuracy

  • Collaborate across teams to understand data requirements and implement technical solutions

  • Automate data processes and manage infrastructure provisioning using IaC tools

  • Monitor data processing jobs, troubleshoot incidents, and perform root cause analysis

  • Maintain documentation for data lineage, workflow configurations, and data security

  • Support migration and platform upgrade projects ensuring minimal disruption

  • Stay updated on new data processing tools, cloud architecture, and compliance standards

Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or related field

  • 4+ years managing large-scale data pipelines, preferably in cloud environments

  • Experience with Hadoop ecosystem, Spark, and ETL tools in enterprise settings

  • Certifications such as AWS Data Analytics, Cloudera, or relevant data platform certifications are advantageous

Professional Competencies

  • Strong analytical and troubleshooting skills in data processing contexts

  • Excellent collaboration and stakeholder management skills

  • Ability to work independently under deadlines and prioritize tasks effectively

  • Continuous learning mindset around emerging data, cloud, and AI/ML technologies

  • Focus on data quality, security, and scalability to meet industry standards

S​YNECHRON’S DIVERSITY & INCLUSION STATEMENT
 

Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.

All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.

Candidate Application Notice

Skills Required

  • Hands-on experience with ETL tools (Informatica, Talend, or equivalent)
  • Proven expertise in Hadoop ecosystem components (HDFS, Hive, Pig, Sqoop)
  • Proficiency in Apache Spark (PySpark, Spark SQL, Spark Streaming)
  • Strong programming skills in Python, Java, or Scala for data processing
  • Experience with SQL and relational databases (Oracle, MySQL, PostgreSQL)
  • Familiarity with cloud data platforms (AWS Redshift, Azure Synapse, GCP BigQuery)
  • 4+ years designing and developing large-scale data pipelines
  • Experience integrating data within cloud ecosystems and maintaining data quality
  • Bachelor's or Master's degree in Computer Science, Data Engineering, or related field
  • Knowledge of cloud-native data migration and integration tools
  • Exposure to NoSQL databases (DynamoDB, Cassandra)
  • Experience with data governance and metadata management tools
  • Certifications such as AWS Data Analytics or Cloudera

Synechron Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Synechron and has not been reviewed or approved by Synechron.

  • Fair & Transparent Compensation Pay is frequently characterized as competitive, particularly relative to large service-consulting peers and in certain in-demand skill areas. Compensation sentiment appears strongest when staffing is stable on strong client engagements and for market-aligned roles in major hubs.
  • Healthcare Strength Healthcare coverage is often portrayed as a strong point in the U.S., with broad coverage and relatively favorable out-of-pocket experiences. Core medical, dental, and vision options are consistently described as meeting or exceeding a baseline expectation for consulting roles.
  • Equity Value & Accessibility Equity was made broadly accessible through a company-wide RSU grant tied to a major revenue milestone. This is positioned as a notable upside even if it is framed as a one-time recognition event rather than an ongoing program.

Synechron Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Maharashtra
12,827 Employees
Year Founded: 2001

What We Do

At Synechron, we believe in the power of digital to transform businesses for the better. Our global consulting firm combines creativity and innovative technology to deliver industry-leading digital solutions. Synechron’s progressive technologies and optimization strategies span end-to-end Artificial Intelligence, Consulting, Digital, Cloud & DevOps, Data, and Software Engineering, servicing an array of noteworthy financial services and technology firms. Through research and development initiatives in our FinLabs we develop solutions for modernization, from Artificial Intelligence and Blockchain to Data Science models, Digital Underwriting, mobile-first applications and more. Over the last 20+ years, our company has been honored with multiple employer awards, recognizing our commitment to our talented teams. With top clients to boast about, Synechron has a global workforce of 14,700+, and has 48 offices in 19 countries within key global markets. For more information on the company, please visit our website: www.synechron.com.

Similar Jobs

Morningstar Logo Morningstar

Artificial Intelligence Engineer

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
11500 Employees

Morningstar Logo Morningstar

Team Lead

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
11500 Employees

Morningstar Logo Morningstar

Analyst, Private Credit Ratings Mumbai

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
11500 Employees

Morningstar Logo Morningstar

Senior Executive Total Rewards

Artificial Intelligence • Big Data • Enterprise Web • Fintech • Software • Financial Services
Hybrid
Navi Mumbai, Thane, Maharashtra, IND
11500 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account