Lead ML Data Engineer, AI Core

Posted 5 Days Ago
Easy Apply
4 Locations
In-Office
Senior level
Financial Services
The Role
Lead design and build scalable data ingestion and feature pipelines for foundation models; implement data quality monitoring; model and integrate new data sources; run experiments measuring data impact; optimize ML training and hyperparameters; collaborate with ML, platform, and infra teams; lead technical initiatives and mentor team members.
Summary Generated by Built In
About Us

Nu is one of the largest digital financial platforms in the world, with more than 127 million customers across Brazil, Mexico, and Colombia. Guided by our mission to fight complexity and empower people, we are redefining financial services in Latin America and this is still just the beginning of the purple future we're building.

Listed on the New York Stock Exchange (NYSE: NU), we combine proprietary technology, data intelligence, and an efficient operating model to deliver financial products that are simple, accessible, and human.

Our impact has been recognized by global rankings such as Time 100 Companies, Fast Company's Most Innovative Companies, and Forbes World's Best Bank. Visit our institutional page [Careers at Nu - Join our team!](https://international.nubank.com.br/careers/)

About the Role

At Nu, data is the foundation that powers our AI and machine learning models, enabling millions of customers to access fair financial products. As a Machine Learning Engineer in AI Core, Data Intelligence, you’ll work across a broad spectrum — from building scalable data infrastructure and feature pipelines that feed our state-of-the-art foundation models to designing, training, and shipping transaction classification models that power critical customer experiences across the company.

You'll work at the intersection of data and applied machine learning, contributing across multiple stages of the ML lifecycle: ingesting and labeling data, training and evaluating models, and helping with deployment and production monitoring through robust quality controls. You’ll partner closely with product, compliance, and ML teams to ensure models are auditable, privacy-aware, and deliver measurable business value.

You'll join a team that manages the data engineering backbone of AI Core, ensuring data is accessible, healthy, and properly tracked across our entire ML ecosystem. Here, you'll combine your expertise in building scalable data systems with your passion for machine learning, creating solutions that enable our models to learn from better, richer data.

You can read more about the work in the AI Core team on our blog: https://building.nubank.com/understanding-our-customers-finances-through-foundation-models/

Key Responsibilities

As a Lead Machine Learning Engineer in AI Core Data Intelligence, you will:

  • Design and build scalable data ingestion pipelines that bring new datasets into our AI Core platform, ensuring reliable, efficient data flow from source to model training.
  • Implement data quality monitoring and validation systems that catch issues before they impact model performance, maintaining the health of datasets across our ML ecosystem.
  • Model new types of data into our foundation models.
  • Analyze the impact of new data sources on existing models, conducting experiments to measure performance improvements and guide data acquisition decisions.
  • Develop and maintain data preparation workflows that transform raw data into features ready for model training, working with distributed computing frameworks like Ray.
  • Tune and optimize machine learning models when new datasets are integrated, applying hyperparameter optimization and evaluating model performance improvements.
  • Collaborate with AI Core ML, Platform, and Infra teams to ensure seamless data flow across our ML infrastructure, from ingestion to model deployment.
  • Lead technical initiatives that improve our data engineering practices, setting standards for data quality, pipeline reliability, and model-data integration.
  • Mentor team members and contribute to hiring activities, helping build a strong and diverse team that drives innovation in AI infrastructure.
Basic Qualifications
  • Typically 6+ years of experience in machine learning engineering, data engineering, or related fields with a strong track record of building production data and ML systems.
  • Proven experience designing and building data ingestion pipelines at scale, with expertise in distributed computing frameworks (Ray, Spark, or similar).
  • Strong background in applied machine learning, including model training, hyperparameter tuning, and performance evaluation.
  • Experience analyzing how data changes impact model performance, with the ability to design and run experiments to measure improvements.
  • Proficiency in Python for data engineering and ML workflows, with experience working with large-scale data processing systems.
  • Solid understanding of data quality principles and experience implementing monitoring, validation, and alerting systems.
  • Strong problem-solving skills with the ability to address complex, ambiguous problems requiring coordination across multiple teams.
  • Excellent communication skills, capable of explaining technical concepts to both technical and non-technical stakeholders.
  • Demonstrated leadership experience, including mentoring team members and contributing to technical decision-making.
Preferred Qualifications
  • Experience with MLflow or similar model tracking and versioning systems.
  • Knowledge of foundation models, fine-tuning workflows, and transformer architectures.
  • Experience with data pipeline orchestration tools (Dagster, Airflow, or similar).
  • Background in financial services or fintech, understanding the unique data challenges in this domain.
  • Experience working in a fast-paced, high-growth environment with distributed teams.
  • Track record of reducing complexity in data systems and improving developer experience for ML teams.
Our Benefits
  • Opportunity of earning equity at Nu
  • Medical Insurance
  • Dental and Vision Insurance
  • Life Insurance and AD&D
  • Extended maternity and paternity leaves 
  • Nucleo - Our learning platform of courses
  • NuLanguage - Our language learning program
  • NuCare - Our mental health and wellness assistance program
  • Extended maternity and paternity leaves 
  • 401K
  • Saving Plans - Health Saving Account and Flexible Spending Account
  • Work-from-home Allowance
  • Relocation Assistance Package, if applicable.
Work Model for this Role

Hybrid 2-3 times/week: Our hybrid work model brings us to the office at least twice a week, on strategic days designed to maximize team connection and collaboration. For more details, visit https://building.nubank.com/nu-hybrid-work-model/

Locations: This role is available in any of our North American offices (Palo Alto, USA; Miami, USA; Durham, USA; Toronto, CAN)

Top Skills

Python,Ray,Spark,Mlflow,Dagster,Airflow,Transformer Architectures,Foundation Models
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: São Paulo, São Paulo
13,649 Employees
Year Founded: 2013

What We Do

Nu was born in 2013 with the mission to fight complexity to empower people in their daily lives by reinventing financial services.

We are one of the world’s largest digital banking platforms, serving more than 70 million customers across Brazil, Mexico, and Colombia.

As one of the leading technology companies in the world, Nu leverages proprietary technologies and innovative business practices to create new financial solutions and experiences for individuals and SMEs that are simple, intuitive, convenient, low-cost, empowering, and human.

Guided by its mission, Nu is fostering access to financial services across Latin America.

Similar Jobs

TransUnion Logo TransUnion

Senior Director, Data Science Product Development, Specialized Risk

Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Hybrid
7 Locations
13000 Employees
167K-250K Annually

GRAIL Logo GRAIL

Senior Medical Science Liaison / Medical Science Liaison ( Remote FL) # 4546

Artificial Intelligence • Big Data • Healthtech • Machine Learning • Software • Biotech
Remote or Hybrid
FL, USA
918 Employees
165K-206K Annually

Vantor Logo Vantor

Technical Instructor (TS/SCI)

Aerospace • Artificial Intelligence • Computer Vision • Software • Analytics • Defense • Big Data Analytics
In-Office
Doral, FL, USA
2500 Employees
81K-173K Annually

Pluralsight Logo Pluralsight

Account Executive

Edtech • Information Technology • Software
Remote or Hybrid
USA
1300 Employees
203K-254K Annually

Similar Companies Hiring

Amplify Platform Thumbnail
Fintech • Financial Services • Consulting • Cloud • Business Intelligence • Big Data Analytics
Scottsdale, AZ
62 Employees
Rain Thumbnail
Web3 • Payments • Infrastructure as a Service (IaaS) • Fintech • Financial Services • Cryptocurrency • Blockchain
New York, NY
80 Employees
Granted Thumbnail
Mobile • Insurance • Healthtech • Financial Services • Artificial Intelligence
New York, New York
23 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account