Lead Python Full Stack Data Engineer - Vice President

Reposted 10 Days Ago
Be an Early Applicant
Mississauga, ON, CAN
In-Office
121K-171K Annually
Senior level
Fintech • Financial Services
The Role
Lead Full Stack Data Engineer responsible for architecting, developing, and optimizing data solutions, mentoring teams, and driving AI integration in data processes.
Summary Generated by Built In

We are assembling an A-team of highly skilled, autonomous, and visionary engineers, and we are seeking an exceptional Lead Full Stack Data Engineer to join our high-performing, co-located squads in Canada. This senior role is for a hands-on player/coach who not only masters the full spectrum of data engineering but also demonstrates exemplary leadership, strategic thinking, and an unwavering commitment to leveraging AI for transformative productivity. The ideal candidate will take ownership of complex data products and platforms, driving the design, development, and optimization of end-to-end data solutions from ingestion to advanced consumption. We are looking for a true AI-first thinker who can architect scalable systems, mentor emerging talent, profoundly understand the functional domains our work impacts, and significantly contribute to our data strategy and culture.

Responsibilities:

  • Lead and Architect end-to-end data solutions, providing technical direction and architectural oversight for complex data pipelines and platforms, ensuring robust performance, scalability, data quality, security, and compliance.
  • Drive Strategic Initiatives within small, co-located squads (4-7 person teams), fostering an environment of high communication, minimal coordination overhead, and collective ownership to deliver impactful data products.
  • Act as a Player/Coach, leading by example in hands-on development while actively mentoring and elevating the technical capabilities of junior and mid-level engineers, cultivating a culture of technical excellence and innovation.
  • Design, Develop, and Optimize highly efficient and resilient data ingestion, processing, and transformation pipelines using advanced Python and PySpark techniques for petabyte-scale datasets.
  • Architect and Implement sophisticated data storage solutions leveraging a diverse set of big data technologies including Hive, distributed file systems (e.g., HDFS, S3), and enterprise-grade NoSQL databases (e.g., Cassandra, MongoDB).
  • Champion Data Modeling and Governance, designing scalable data models and schemas that support advanced analytics, machine learning, and critical reporting needs, ensuring data integrity, accessibility, and discoverability.
  • Strategically Engage with data consumers, data scientists, and business stakeholders to deeply understand their requirements, translating them into robust data solutions and providing expert guidance on data utilization and interpretation.
  • Lead the Implementation of real-time data streaming and complex event-driven architectures using technologies like Apache Kafka, ensuring low-latency data availability for critical business functions.
  • Enforce and Evolve Best Practices in data engineering and software development, spearheading rigorous code reviews, comprehensive automated testing strategies, and robust CI/CD pipelines within a DevOps culture.
  • Exhibit High Autonomy and Agency, taking full ownership of technical challenges, making well-reasoned architectural decisions, and proactively identifying and implementing continuous improvements across the data landscape.
  • Innovate with AI-Powered Development, actively leveraging, integrating, and contributing to AI coding tools (e.g., internal Citi AI tools, Copilot, Claude Code, Codex, Antigravity) to set new benchmarks for productivity, code quality, and development velocity, and inspiring others to do the same.
  • Shape the Future of Our Data Stack, actively participating in technical discussions, evaluating new technologies, and making strategic recommendations that align with business objectives and architectural vision.
  • Expertly Troubleshoot and Resolve the most challenging technical issues within complex, distributed big data environments, applying advanced analytical and problem-solving methodologies.

Required Skills & Experience:

  • Experience: 6+ years of progressive, hands-on experience as a Senior/Lead Data Engineer, with a proven track record of architecting and delivering complex, large-scale data solutions, and operating effectively as a player/coach.
  • Programming Languages:
    • Expert-level proficiency in Python, with deep expertise in developing highly optimized, scalable, and production-grade PySpark applications for mission-critical data processing.
  • Big Data Frameworks/Technologies:
    • Deep architectural understanding and extensive hands-on experience with the entire Apache Spark ecosystem (Spark Core, Spark SQL, Spark Streaming, Spark MLlib).
    • Advanced proficiency with Hive for enterprise data warehousing, including optimization techniques for large and complex queries.
    • Expert knowledge of distributed computing fundamentals, HDFS, and other components of the Hadoop ecosystem.
  • Data Storage & Management:
    • Master-level proficiency in SQL, complex query optimization, and advanced data warehousing concepts (e.g., dimensional modeling, data vault, data lakes).
    • Extensive experience with various data storage formats (e.g., Parquet, ORC, Avro) and leading data lake solutions (e.g., Delta Lake, Iceberg).
    • Proven experience with enterprise-grade NoSQL databases (e.g., Cassandra, MongoDB, HBase) and understanding of their architectural trade-offs.
  • Messaging & Event Streaming:
    • Expert-level experience with Apache Kafka, including design and implementation of high-throughput, low-latency real-time data pipelines and event-driven microservices architectures.
  • Cloud Platforms:
    • Extensive experience with big data services on major cloud platforms (e.g., AWS EMR/Glue/Redshift/Kinesis, Azure Databricks/Data Factory/Synapse/Event Hubs, GCP Dataflow/Dataproc/BigQuery/Pub/Sub), including cloud-native architectural patterns.
  • AI-Powered Development & Productivity:
    • Mandatory: Demonstrated mastery and innovative application of AI coding tools (e.g., Claude Code, Codex, Antigravity) to significantly enhance the development lifecycle.
    • A proactive, "AI-first thinker" mindset, with a proven ability to evaluate, integrate, and evangelize new AI tools and methodologies within the team to drive continuous improvement and innovation.
  • Domain Understanding:
    • Expert ability to articulate the intricacies of the functional domain, proactively identifying business challenges and opportunities, and translating them into impactful, data-driven solutions.
  • Leadership & Mentoring:
    • Proven ability to lead technical discussions, mentor team members, and foster a collaborative and high-performing engineering culture.
  • Other Essential Skills:
    • Advanced understanding of software engineering principles, design patterns, data structures, algorithms, and performance engineering for distributed systems.
    • Extensive experience with RESTful API design, development, and integration for data services.
    • Strong expertise in containerization technologies (e.g., Docker, Kubernetes) and orchestration for deploying and managing scalable data applications.
    • Master-level proficiency with version control systems, especially Git, including advanced branching, merging, and code review strategies.
    • Exceptional problem-solving, analytical, and debugging skills applied to highly complex, distributed big data ecosystems.
    • Superior communication, presentation, and interpersonal skills, with the ability to articulate complex technical concepts to diverse audiences and influence strategic decisions.
    • Demonstrated highest levels of autonomy and agency in driving strategic initiatives and delivering impactful, innovative data solutions.

Education:

  • Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or a relative experience is required. A Master's degree or Ph.D. is strongly preferred. Equivalent advanced practical experience with a demonstrable track record of architecting and leading major data initiatives will also be considered.

------------------------------------------------------

Job Family Group: Technology

------------------------------------------------------

Job Family:Applications Development

------------------------------------------------------

Time Type:Full time

------------------------------------------------------

Primary Location Full Time Salary Range:$120,800.00 - $170,800.00

------------------------------------------------------

Most Relevant Skills Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Automated Processing and AI

We use automated processing, including artificial intelligence, for our legitimate business interests (or our reasonable and appropriate business purposes) to identify and align the candidate's skills and abilities with a specific job opening. Additionally, if you so choose, or consent, we can match your skills and abilities to other suitable roles at Citi.

Importantly, all our hiring processes and decisions, including determining your suitability for a role, are conducted, checked, and decided by individuals. Our automated processing and AI do not involve relying on automatic or autonomous decision-making. Please refer to any Jurisdictional Considerations, with specific provisions for your country (where relevant) for further details.

------------------------------------------------------

This job opening is for an existing job vacancy.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Skills Required

  • 6+ years of hands-on experience as a Senior/Lead Data Engineer
  • Expert-level proficiency in Python and PySpark
  • Deep understanding of Apache Spark ecosystem
  • Master-level proficiency in SQL and data warehousing concepts
  • Extensive experience with NoSQL databases like Cassandra and MongoDB
  • Expert-level experience with Apache Kafka for real-time data pipelines
  • Experience with big data services on cloud platforms (AWS, Azure, GCP)
  • Demonstrated mastery of AI coding tools

Citi Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Citi and has not been reviewed or approved by Citi.

  • Healthcare Strength Benefits coverage is positioned as comprehensive, including health, dental, and vision insurance plus on-site clinics, prescription drug support, and disability coverage. Family-building support such as fertility assistance is described as a notable differentiator within the overall package.
  • Retirement Support Retirement benefits are framed as strong, highlighted by a 401(k) with matching and additional plan options like a Roth 401(k). Financial support is reinforced through discounts and broader financial guidance resources tied to the benefits ecosystem.
  • Wellbeing & Lifestyle Benefits Wellbeing support extends beyond insurance through programs like an Employee Assistance Program, counseling/legal resources, and gym or wellness reimbursement. These offerings increase the perceived total rewards value even when cash compensation sentiment varies by role.

Citi Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Kwun Tong, Kowloon
223,850 Employees

What We Do

Citi's mission is to serve as a trusted partner to our clients by responsibly providing financial services that enable growth and economic progress. Our core activities are safeguarding assets, lending money, making payments and accessing the capital markets on behalf of our clients. We have 200 years of experience helping our clients meet the world's toughest challenges and embrace its greatest opportunities. We are Citi, the global bank – an institution connecting millions of people across hundreds of countries and cities.

Similar Jobs

WorkWhile Logo WorkWhile

Senior Software Engineer

Artificial Intelligence • HR Tech • Information Technology • Machine Learning • Software • App development • Industrial
Hybrid
4 Locations
100 Employees
150K-200K Annually

Capco Logo Capco

Database Administrator

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Hybrid
Toronto, ON, CAN
6000 Employees
113K-145K Annually

Toast Logo Toast

Software Engineer

Cloud • Fintech • Food • Information Technology • Software • Hospitality
In-Office
Toronto, ON, CAN
5000 Employees
100K-147K Annually

Toast Logo Toast

Senior Software Engineer

Cloud • Fintech • Food • Information Technology • Software • Hospitality
In-Office
Toronto, ON, CAN
5000 Employees
125K-175K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account