Lead Data Engineer

Posted 11 Hours Ago
Be an Early Applicant
Pune, Mahārāshtra
Hybrid
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
The Role
Design, build, and deploy scalable Big Data solutions. Lead data engineering projects and mentor junior engineers while implementing best practices in data governance and analytics.
Summary Generated by Built In

TransUnion's Job Applicant Privacy Notice

What We'll Bring:

What You'll Bring:
We are looking for a Lead Data Engineer to join our growing Data Engineering and Analytics Practice who will drive building next generation suite of products and platform by designing, coding, building, and deploying highly scalable and robust solutions. You will be based both from our offices in Pune and working remotely as part of our ‘flex together’ approach. In this fast-paced role you will work with Business Stakeholders to achieve business goals. This exciting role will offer a host of development opportunities as part of a growing global business.

What You'll Bring:

Key Responsibilities:
 

  • Design, build, test, and deploy innovative Big Data solutions at scale, including data lakes, data warehouses, and real-time analytics.
  • Extract, clean, transform, and analyze vast amounts of raw data from various data sources.
  • Build robust data pipelines and API integrations with various internal systems.
  • Work across all stages of the data lifecycle, including data ingestion, storage, processing, and visualization.
  • Implement best practices in data governance, security, and compliance across all data analytics processes.
  • Estimate effort, identify risks, and plan execution effectively.
  • Proactively monitor, identify, and escalate issues or root causes of systemic issues.
  • Enable data scientists, business, and product partners to fully leverage our platform.
  • Engage with business stakeholders to understand client requirements and build technical solutions and delivery plans.
  • Evaluate and communicate technical risks effectively and ensure assignments are delivered on schedule with desired quality.
  • Provide end-to-end big data solutions and design details to data engineering teams.
  • Demonstrate excellent analytical and problem-solving skills.
  • Exhibit excellent communication skills, with experience communicating with senior business stakeholders.
  • Lead technical delivery on use cases, plan and delegate tasks to junior team members, and oversee work from inception to final product.
     

Skills & Experience:

Essential:

  • Bachelor’s degree in Computer Science, Engineering, Statistics or a related field
  • 8+ years of data engineering experience, with at least 3 years in senior roles.
  • 5+ years of experience in Big Data technologies (e.g., Spark, Hive, Hadoop, Databricks).
  • Strong experience designing and implementing data pipelines.
  • Excellent knowledge of data engineering concepts and best practices.
  • Proven ability to lead, mentor, inspire, and support junior team members.
  • Ability to lead technical deliverables autonomously and guide junior data engineers.
  • Strong attention to detail and adherence to best practices.
  • Experience in designing solutions using batch data processing methods, real-time streams, ETL processes, and business intelligence tools.
  • Experience designing logical data models and physical data models, including data warehouse and data mart designs.
  • Strong SQL knowledge and experience (T-SQL, working with SQL Server, SSMS).
  • Advanced proficiency with Apache Spark, including PySpark and SparkSQL, for distributed data processing.
  • Working knowledge of Apache Hive.
  • Proficiency in Python, Pandas, PySpark (Scala/Java knowledge is desirable).
  • Knowledge of Delta Lake concepts and common data formats, Lakehouse architecture.
  • Source control with Git.
  • Expertise in designing and implementing scalable data pipelines and ETL processes using the GCP data stack, including BigQuery, Dataflow, Pub/Sub, Cloud Storage, Cloud Composer, Cloud Functions, Dataproc (Spark).
  • Expertise in building and managing ETL workflows using Apache Airflow, including DAG creation, scheduling, and error handling.
  • Knowledge of CI/CD concepts and experience designing CI/CD for data pipelines.
  • Software engineering principles, including:
    • Object-oriented programming (OOP) principles.
    • Design patterns and their application in data engineering.
    • Software development lifecycle (SDLC).
    • Agile methodologies and practices.
    • Unit testing, integration testing, and test-driven development (TDD).
    • Performance optimization and scalability considerations.
       

Desirable:

  • Experience with streaming services such as Kafka is a plus.
  • R & Sparklyr experience is a plus.
  • Knowledge of MLOps concepts, AI/ML lifecycle management, and MLflow.
  • Expertise in writing complex, highly optimized queries across large data sets to write data pipelines and data processing layers.
  • Jenkins experience is a plus.

Relevant certifications (e.g., Google Cloud Professional Data Engineer).
 

Impact You'll Make:

TransUnion – a place to grow:

We know that it is unrealistic to expect candidates to have each and every aspect of the essential and/or desirable skills listed above – if there is something you can’t tick off right now – good, you can learn here!

Impact you will make:

Enable Decision Making across the organization using data driven culture.

This is a hybrid position and involves regular performance of job responsibilities virtually as well as in-person at an assigned TU office location for a minimum of two days a week.

TransUnion Job Title

Specialist IV, Data Science and Analytics

Top Skills

Apache Airflow
BigQuery
Cloud Composer
Cloud Functions
Cloud Storage
Databricks
Dataflow
Dataproc
GCP
Git
Hadoop
Hive
Jenkins
Pandas
Pub/Sub
Pyspark
Python
Spark
SQL

What the Team is Saying

Patrick
Tiana
Jason
Lauren
TC
Jay
Aayushi
Paul
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Chicago, IL
13,000 Employees
Year Founded: 1968

What We Do

TransUnion is a global information and insights company that makes trust possible by ensuring that each consumer is reliably and safely represented in the marketplace.

We do this by having an accurate and comprehensive picture of each person.

This picture is grounded in our legacy as a credit reporting agency which enables us to tap into both credit and public record data; our data fusion methodology that helps us link, match and tap into the awesome combined power of that data; and our knowledgeable and passionate team, who stewards the information with expertise, and in accordance with local legislation around the world.

Because of our work, organizations can better understand consumers in order to make more informed decisions, and earn their trust through great, personalized experiences, and the proactive extension of the right opportunities, tools and offers. In turn, consumers can be confident that their data identities will result in the opportunities they deserve.

We make trust possible, so businesses and consumers can transact with confidence and achieve great things. We call this Information for Good®—it’s our purpose, and what drives us every day.

Why Work With Us

Our culture is welcoming, energetic and innovative. There’s an overall synergy that flows throughout TransUnion, creating a sense of unity in knowing that we’re all working to achieve the same overall goal. We’re dedicated to providing opportunities for our people to get involved and stay connected with their colleagues across the globe.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

TransUnion Teams

Team
Invested in Tech Teams
About our Teams

TransUnion Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
HQChicago, IL
MX
Amsterdam, NL
Bengaluru, IN
Boca Raton, FL
Bogotá, Colombia
Burlington, ON
South Africa
Cerqueira César, Sao Paulo
Chennai, IN
Cherry Hill, NJ
Cork, County Cork
Crum Lynne, PA
Denver, CO
Greenwood Village, CO
Guaynabo, PR
Gurugram, IN
Hamburg, DE
Hyderabad, IN
Johannesburg, ZA
TransUnion UK Head Office
London, GB
Louisville, KY
Madrid, ES
Makati, PH
Mumbai, IN
New York, NY
Pune, IN
Reston, VA
San Luis Obispo, CA
Santiago, CL
Sydney, NSW
Toronto, ON
Ulloa, La Aurora
Washington, US
White Plains, NY
Learn more

Similar Jobs

TransUnion Logo TransUnion

Senior Engineer

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
4 Locations
13000 Employees

TransUnion Logo TransUnion

ACCOUNTS PAYABLE PROCESSOR

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Pune, Mahārāshtra, IND
13000 Employees

TransUnion Logo TransUnion

Senior Data Engineer

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Pune, Mahārāshtra, IND
13000 Employees

TransUnion Logo TransUnion

Analyst, Tax

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
Pune, Mahārāshtra, IND
13000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account