Citi is looking for a Senior Big Data Engineer to design, build, and optimize large-scale data pipelines and distributed data systems that power critical business intelligence across the organisation. Based in Pune and operating in a hybrid model, you will work within a high-performing engineering team where your expertise in PySpark, the Hadoop ecosystem, and streaming data platforms will directly shape the reliability and performance of Citi's data infrastructure.
Responsibilities
- Build and maintain scalable data pipelines using PySpark within a Big Data environment to process and transform large volumes of structured and unstructured data.
- Design and develop solutions across the Hadoop ecosystem — including Hive, HDFS, Sqoop, Spark, Impala, and Scala — to enable efficient data ingestion, processing, and storage.
- Develop and manage real-time and batch data workflows using streaming data platforms, ensuring high availability and low-latency data delivery.
- Write complex SQL queries to extract, validate, and analyse data across distributed systems, supporting data-driven decision-making.
- Design and implement data models and data architecture patterns aligned with data warehouse principles, ensuring scalability, accuracy, and consistency.
- Automate pipeline scheduling and orchestration using shell scripting and Autosys, reducing manual intervention and improving operational reliability.
- Independently identify, assess, and resolve technical risks and data issues in a timely manner, maintaining system integrity across the data platform.
Required Qualifications & Skills
- Hands-on expertise in PySpark and Big Data processing, with the ability to build and optimise distributed data workflows at scale.
- Practical knowledge of the Hadoop ecosystem, including Hive, HDFS, Sqoop, Spark, Impala, and Scala, applied in a production environment.
- Proficiency in complex SQL query development for data analysis, transformation, and validation across large datasets.
- Solid understanding of distributed systems architecture and how data flows across interconnected processing layers.
- Demonstrated knowledge of data modelling and data design, with familiarity in data warehouse concepts and dimensional modelling techniques.
- Competence in shell scripting and job scheduling using Autosys or equivalent workflow automation tools.
- Strong analytical and problem-solving ability, with a track record of working independently to diagnose and resolve complex data engineering challenges.
- Clear and effective communication skills, with the ability to articulate technical concepts to both technical and non-technical audiences.
Beneficial Skills & Qualifications
- Familiarity with streaming data platforms such as Apache Kafka or equivalent real-time data processing technologies.
- Exposure to cloud-based Big Data environments and modern data lakehouse architectures.
- Experience working in financial services or regulated industries where data quality and governance are critical.
What We Offer
At Citi, we invest in our people as much as we invest in our technology. Joining our Pune-based engineering team means working on meaningful data problems at global scale, with the flexibility, support, and resources to grow your career in a direction that matters to you.
- Hybrid working model — 3 days in the office and 2 days working remotely, giving you flexibility without sacrificing collaboration.
- Access to continuous learning and development programmes to deepen your technical expertise in Big Data, cloud, and data engineering.
- Exposure to large-scale, complex data systems that power real business outcomes across a global financial institution.
- A collaborative and inclusive team environment where diverse perspectives are valued and technical ownership is encouraged.
- Competitive compensation and a comprehensive benefits package aligned to your experience and contribution.
- Wellbeing and work-life balance support, including programmes designed to help you thrive professionally and personally.
- Global connectivity — collaborate with engineering and data teams across Citi's international network, broadening your perspective and career reach.
Build the data infrastructure that drives intelligent decisions at one of the world's largest financial institutions — apply today and bring your Big Data expertise to Citi.
------------------------------------------------------
Job Family Group: Technology------------------------------------------------------
Job Family:Applications Development------------------------------------------------------
Time Type:Full time------------------------------------------------------
Most Relevant Skills Please see the requirements listed above.------------------------------------------------------
Other Relevant Skills For complementary skills, please see above and/or contact the recruiter.------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
Skills Required
- 8+ years of experience in Tableau advanced concepts and Tableau development (reports, dashboards, documents)
- Experience designing and developing advanced Tableau dashboards, visualizations, and ad-hoc reports
- Experience with Python and PySpark scripts to automate data processes and perform advanced analytics
- Advanced data manipulation and analysis using complex SQL queries
- Experience with data validation, data migration, and building charts/graphs for visualization
- Expertise in applications programming and adherence to overall architecture and design standards
- Develop standards for coding, testing, debugging, and implementation
- Experience mentoring or coaching mid-level developers and analysts
- Experience with AI tools (Adapting Artificial Intelligence Devin, Stylus)
- Experience working with Starburst or Druid AI
Citi Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Citi and has not been reviewed or approved by Citi.
-
Healthcare Strength — Benefits coverage is positioned as comprehensive, including health, dental, and vision insurance plus on-site clinics, prescription drug support, and disability coverage. Family-building support such as fertility assistance is described as a notable differentiator within the overall package.
-
Retirement Support — Retirement benefits are framed as strong, highlighted by a 401(k) with matching and additional plan options like a Roth 401(k). Financial support is reinforced through discounts and broader financial guidance resources tied to the benefits ecosystem.
-
Wellbeing & Lifestyle Benefits — Wellbeing support extends beyond insurance through programs like an Employee Assistance Program, counseling/legal resources, and gym or wellness reimbursement. These offerings increase the perceived total rewards value even when cash compensation sentiment varies by role.
Citi Insights
What We Do
Citi's mission is to serve as a trusted partner to our clients by responsibly providing financial services that enable growth and economic progress. Our core activities are safeguarding assets, lending money, making payments and accessing the capital markets on behalf of our clients. We have 200 years of experience helping our clients meet the world's toughest challenges and embrace its greatest opportunities. We are Citi, the global bank – an institution connecting millions of people across hundreds of countries and cities.






