TransUnion's Job Applicant Privacy Notice
What We'll Bring:
What You'll Bring:We are looking for a Lead Data Engineer to join our growing Data Engineering and Analytics Practice who will drive building next generation suite of products and platform by designing, coding, building, and deploying highly scalable and robust solutions. You will be based both from our offices in Pune and working remotely as part of our ‘flex together’ approach. In this fast-paced role you will work with Business Stakeholders to achieve business goals. This exciting role will offer a host of development opportunities as part of a growing global business.
What You'll Bring:
Key Responsibilities:
- Design, build, test, and deploy innovative Big Data solutions at scale, including data lakes, data warehouses, and real-time analytics.
- Extract, clean, transform, and analyze vast amounts of raw data from various data sources.
- Build robust data pipelines and API integrations with various internal systems.
- Work across all stages of the data lifecycle, including data ingestion, storage, processing, and visualization.
- Implement best practices in data governance, security, and compliance across all data analytics processes.
- Estimate effort, identify risks, and plan execution effectively.
- Proactively monitor, identify, and escalate issues or root causes of systemic issues.
- Enable data scientists, business, and product partners to fully leverage our platform.
- Engage with business stakeholders to understand client requirements and build technical solutions and delivery plans.
- Evaluate and communicate technical risks effectively and ensure assignments are delivered on schedule with desired quality.
- Provide end-to-end big data solutions and design details to data engineering teams.
- Demonstrate excellent analytical and problem-solving skills.
- Exhibit excellent communication skills, with experience communicating with senior business stakeholders.
- Lead technical delivery on use cases, plan and delegate tasks to junior team members, and oversee work from inception to final product.
Skills & Experience:
Essential:
- Bachelor’s degree in Computer Science, Engineering, Statistics or a related field
- 8+ years of data engineering experience, with at least 3 years in senior roles.
- 5+ years of experience in Big Data technologies (e.g., Spark, Hive, Hadoop, Databricks).
- Strong experience designing and implementing data pipelines.
- Excellent knowledge of data engineering concepts and best practices.
- Proven ability to lead, mentor, inspire, and support junior team members.
- Ability to lead technical deliverables autonomously and guide junior data engineers.
- Strong attention to detail and adherence to best practices.
- Experience in designing solutions using batch data processing methods, real-time streams, ETL processes, and business intelligence tools.
- Experience designing logical data models and physical data models, including data warehouse and data mart designs.
- Strong SQL knowledge and experience (T-SQL, working with SQL Server, SSMS).
- Advanced proficiency with Apache Spark, including PySpark and SparkSQL, for distributed data processing.
- Working knowledge of Apache Hive.
- Proficiency in Python, Pandas, PySpark (Scala/Java knowledge is desirable).
- Knowledge of Delta Lake concepts and common data formats, Lakehouse architecture.
- Source control with Git.
- Expertise in designing and implementing scalable data pipelines and ETL processes using the GCP data stack, including BigQuery, Dataflow, Pub/Sub, Cloud Storage, Cloud Composer, Cloud Functions, Dataproc (Spark).
- Expertise in building and managing ETL workflows using Apache Airflow, including DAG creation, scheduling, and error handling.
- Knowledge of CI/CD concepts and experience designing CI/CD for data pipelines.
- Software engineering principles, including:
- Object-oriented programming (OOP) principles.
- Design patterns and their application in data engineering.
- Software development lifecycle (SDLC).
- Agile methodologies and practices.
- Unit testing, integration testing, and test-driven development (TDD).
- Performance optimization and scalability considerations.
Desirable:
- Experience with streaming services such as Kafka is a plus.
- R & Sparklyr experience is a plus.
- Knowledge of MLOps concepts, AI/ML lifecycle management, and MLflow.
- Expertise in writing complex, highly optimized queries across large data sets to write data pipelines and data processing layers.
- Jenkins experience is a plus.
Relevant certifications (e.g., Google Cloud Professional Data Engineer).
Impact You'll Make:
TransUnion – a place to grow:
We know that it is unrealistic to expect candidates to have each and every aspect of the essential and/or desirable skills listed above – if there is something you can’t tick off right now – good, you can learn here!
Impact you will make:
Enable Decision Making across the organization using data driven culture.
This is a hybrid position and involves regular performance of job responsibilities virtually as well as in-person at an assigned TU office location for a minimum of two days a week.TransUnion Job Title
Specialist IV, Data Science and AnalyticsTop Skills
What We Do
TransUnion is a global information and insights company that makes trust possible by ensuring that each consumer is reliably and safely represented in the marketplace.
We do this by having an accurate and comprehensive picture of each person.
This picture is grounded in our legacy as a credit reporting agency which enables us to tap into both credit and public record data; our data fusion methodology that helps us link, match and tap into the awesome combined power of that data; and our knowledgeable and passionate team, who stewards the information with expertise, and in accordance with local legislation around the world.
Because of our work, organizations can better understand consumers in order to make more informed decisions, and earn their trust through great, personalized experiences, and the proactive extension of the right opportunities, tools and offers. In turn, consumers can be confident that their data identities will result in the opportunities they deserve.
We make trust possible, so businesses and consumers can transact with confidence and achieve great things. We call this Information for Good®—it’s our purpose, and what drives us every day.
Why Work With Us
Our culture is welcoming, energetic and innovative. There’s an overall synergy that flows throughout TransUnion, creating a sense of unity in knowing that we’re all working to achieve the same overall goal. We’re dedicated to providing opportunities for our people to get involved and stay connected with their colleagues across the globe.
Gallery
TransUnion Teams
TransUnion Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
















