Senior Data Scientist - International eKYC, Identity Graph

Posted 23 Days Ago
Hiring Remotely in United States
Remote
170K-200K Annually
Senior level
Artificial Intelligence • Machine Learning • Software • Analytics
Our mission is to verify 100% of good identities in real-time and completely eliminate identity fraud on the internet.
The Role
Lead the design and deployment of machine learning and graph-based systems for international identity verification, ensuring compliance and addressing regional challenges.
Summary Generated by Built In
Why Socure?

Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.

We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won’t be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.

About the Role

The Big Data R&D team builds the core entity‑resolution and graph‑based intelligence that underpins Socure’s Verify and KYC products. As a Senior Data Scientist focused on international eKYC, you will be a technical leader driving the next generation of global identity verification solutions. You will design and deploy ML and graph-based systems tailored to diverse international markets, regulations, and data ecosystems—covering government IDs, telco and credit bureaus, mobile-first data, and non‑traditional signals.

You will own complex, cross‑product initiatives such as international identity graph evolution, probabilistic matching for non‑US identities, and scalable evaluation frameworks that account for regional regulatory and fairness constraints. You will closely partner with Product, Engineering, Compliance, and GTM teams to launch and scale eKYC solutions across multiple countries and regions.

What You'll Do

International eKYC Modeling & Entity Resolution

  • Lead the design, development, and deployment of ML and graph-based algorithms for international entity resolution, identity trust scoring, and anomaly detection across heterogeneous, country‑specific datasets.

  • Architect reusable matching and linking frameworks that work across multiple ID schemes (e.g., national ID numbers, passports, voter IDs, mobile accounts, bank accounts) and local name/address conventions.

  • Develop probabilistic and rule‑augmented models that handle noisy, sparse, or partially labeled international data while maintaining explainability and regulatory defensibility.

Global Identity Graph & Data Quality

  • Define and evolve the international extension of Socure’s identity graph: schema design, linkage strategies, quality tiers, and confidence scoring that can be leveraged by multiple products (Verify, KYC, watchlists, fraud).

  • Design and implement robust data quality and monitoring frameworks for international identity data (coverage, stability, drift, regional bias, label quality) and integrate them into modeling and production monitoring workflows.

  • Build scalable approaches for handling linguistic and cultural variation (e.g., transliteration, multi‑script names, address normalization, local naming patterns) in the identity graph and matching pipelines.

Evaluation, Experimentation, and Model Governance

  • Own experimentation strategy for major international eKYC initiatives:

  • Design offline evaluations and online A/B tests that reflect local ground truth constraints and data sparsity.

  • Define success metrics that balance approval rates, fraud capture, and regulatory/operational constraints per market.

  • Analyze lift, stability, and fairness trade‑offs and drive go/no‑go decisions with Product and Engineering.

  • Define and maintain evaluation frameworks specific to international eKYC (e.g., regional coverage maps, cross‑border identity leakage, local demographic impact, regulatory thresholds).

  • Contribute to model governance documentation and support responses to regulators and large enterprise customers regarding model logic, data provenance, fairness, and monitoring for international markets.

Data Source Strategy & Vendor Evaluation (International)

  • Lead the evaluation and integration of international data vendors (e.g., bureaus, telcos, public records, alternative data):

  • Design benchmarking methodologies for signal quality, incremental value, stability, and fairness by country/segment.

  • Quantify ROI and trade‑offs across multiple vendors and data types; provide clear recommendations that influence product and commercial decisions.

  • Partner with Data Acquisition, Legal, and Compliance to ensure that data usage and modeling approaches meet regional regulatory requirements (e.g., GDPR and local privacy/AML/KYC rules).

Technical Leadership & Cross‑Functional Partnership

  • Collaborate with engineering leaders to design scalable, reliable international data and model pipelines using Spark/PySpark, AWS (EMR, S3, SageMaker, Neptune), and modern MLOps workflows.

  • Act as a subject‑matter expert on international identity, eKYC regulations, and cross‑border data limitations for internal stakeholders, supporting complex customer questions and strategic roadmap discussions.

  • Mentor Data Scientists and Senior Data Scientists on best practices for international modeling: handling low‑label regimes, domain adaptation, localization of thresholds/logic, and building reusable abstractions instead of one‑off country fixes.

  • Communicate strategy, progress, and results to senior leadership and cross‑functional partners through clear documents and presentations, framing complex technical work in terms of business impact, regional risk, and regulatory trade‑offs.

What You Bring

Education & Experience

  • Master’s or Ph.D. in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field, or equivalent practical experience.

  • 6+ years of hands-on applied ML / data science experience (4+ with Ph.D.), including owning production models and pipelines in high‑stakes domains (fraud, risk, identity, payments, credit, or similar).

  • Significant prior work on international or multi‑region products is strongly preferred (e.g., cross‑country KYC, credit risk, payments, or compliance systems).

Technical Skills

  • Expert‑level proficiency in Python and SQL, with extensive experience in distributed data processing (Spark/PySpark, Databricks or similar) on very large datasets.

  • Deep experience designing, training, and deploying models for classification, ranking, anomaly detection, and/or graph learning, including:

  • Feature engineering for noisy/heterogeneous identity data.

  • Robust evaluation under label sparsity and feedback delays.

  • Calibration and thresholding tailored to regional risk and regulatory constraints.

  • Proven expertise with graph technologies (e.g., Neo4j, AWS Neptune, GraphFrames, DGL, PyTorch Geometric) and graph algorithms (entity resolution, link prediction, community detection, label propagation) at scale.

Please note that sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered.

Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly.

Follow Us!

YouTube | LinkedIn | X (Twitter) | Facebook

Skills Required

  • Master's or Ph.D. in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or related field
  • 6+ years of hands-on applied ML/data science experience
  • Significant prior work on international or multi-region products preferred
  • Expert-level proficiency in Python and SQL
  • Deep experience in distributed data processing
  • Proven expertise with graph technologies and algorithms
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chennai, Tamil Nadu
386 Employees
Year Founded: 2012

What We Do

Socure is the leading platform for digital identity trust. Its predictive analytics platform applies artificial intelligence and machine learning techniques with trusted online/offline data intelligence from email, phone, address, IP, device, velocity, and the broader internet to verify identities in real time. The company has more than 750 customers across the financial services, gaming, telecom, and e-commerce industries, including three of the top five banks, seven of the top 10 card issuers, three of the top MSBs, the top payroll provider, the top credit bureau, and over 100 of the largest and most successful FinTechs. Marquee customers include Chime, Varo Money, Public, Stash, and DraftKings. Socure has received numerous industry awards and accolades, including being named to Forbes America’s Best Startup Employers 2021, being awarded Best New Technology Introduced over the Last 12 Months – Data and Data Services at the 2020 American Financial Technology Awards (AFTAs), being ranked number 70 in Deloitte’s Technology Fast 500™, being listed as a Gartner Cool Vendor, being recognized by Forbes as one of the Top 25 Machine Learning Startups to Watch, being named to CB Insights: The FinTech 250, and being awarded Finovate’s Award for Best Use of AI/ML, to name a few.

Why Work With Us

Socure is a critical part of the infrastructure of the digital economy and what we do is critical to ensure the safety of anyone doing any sort of business on the internet. Because of our technology digital identity theft will be eradicated and more people will be included in the digital economy than ever before.

Gallery

Gallery

Similar Jobs

DFIN Logo DFIN

Treasury Associate

Fintech • Software
Remote or Hybrid
United States
1750 Employees
Remote or Hybrid
United States
1750 Employees

Adswerve, Inc. Logo Adswerve, Inc.

Manager, Digital Media Strategy

AdTech • Artificial Intelligence • Cloud • Digital Media • Marketing Tech • Analytics • Consulting
Easy Apply
Remote
United States
250 Employees
115K-135K Annually

Adswerve, Inc. Logo Adswerve, Inc.

Digital Media Strategist

AdTech • Artificial Intelligence • Cloud • Digital Media • Marketing Tech • Analytics • Consulting
Easy Apply
Remote
United States
250 Employees
90K-110K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account