Data Scientist II - Big Data R&D, Identity Graph & KYC

Reposted 17 Days Ago
5 Locations
Remote or Hybrid
140K-170K Annually
Mid level
Artificial Intelligence • Machine Learning • Software • Analytics
Our mission is to verify 100% of good identities in real-time and completely eliminate identity fraud on the internet.
The Role
The role involves developing graph algorithms, data pipelines for identity verification, and supporting data analysis for compliance products.
Summary Generated by Built In
Why Socure?

Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.

We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won’t be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.

About the Role

The Big Data R&D team is responsible for building the core identity graph and entity-resolution capabilities that power Socure’s KYC and compliance products. In this role, you will help develop graph-based algorithms and data pipelines on massive PII datasets, support modelers with high-quality features, and evaluate new data sources that feed our identity and fraud products. You will work closely with senior data scientists and engineers while developing your skills in large-scale ML, distributed systems, and graph analytics.

What You'll Do
  • Contribute to the design and implementation of machine learning, data mining, statistical, and graph-based algorithms to analyze very large datasets for identity verification and anomaly detection.

  • Analyze large datasets to help develop and refine entity-resolution and identity-matching algorithms that drive Socure’s KYC and compliance solutions.

  • Build and maintain components of data-processing pipelines (ETL, feature generation, normalization) using tools such as Spark/PySpark and AWS (e.g., EMR, S3).

  • Support senior data scientists with feature engineering, data exploration, error analysis, and A/B test setup for new models and signals.

  • Help evaluate new third‑party and internal data sources: profile data quality, design offline experiments, and summarize impact on coverage and model performance.

  • Implement and maintain SQL and Python/R code for data extraction, transformation, and validation; contribute to code reviews and basic testing.

  • Provide analytical support to compliance and regulatory product teams, including ad hoc investigations, simple dashboards, and data deep dives.

  • Communicate findings in a clear, structured way to peers and cross‑functional partners (Product, Engineering, Client Analysis), focusing on key insights and trade‑offs.

  • Work effectively in a fast‑paced, cross‑functional environment; demonstrate ownership of well-scoped tasks and follow through to completion.

What You Bring
  • Master’s degree with 2+ years of experience, or Ph.D. with 1+ years of experience in a data science or analytics role, or equivalent practical experience.

  • Proficiency in at least one general-purpose programming language used in data science (Python, or Scala).

  • Solid experience writing and optimizing SQL for large datasets; comfort working in data lake / warehouse environments.

  • Hands‑on experience with Spark or PySpark and common ML libraries (e.g., scikit‑learn, XGBoost, TensorFlow/PyTorch a plus).

  • Familiarity with UNIX environments and the AWS ecosystem (e.g., EMR, S3); Databricks experience is a plus.

  • Working knowledge of supervised/unsupervised ML and basic statistics (similarity measures, clustering, evaluation metrics).

  • Exposure to graph techniques or graph databases (Neo4j, AWS Neptune, GraphFrames) is a strong plus.

  • Bonus: experience with Elasticsearch or DynamoDB; workflow tools such as Airflow for automating data pipelines.

  • Ability to break down loosely defined problems, ask good clarifying questions, and iterate quickly with feedback.

Please note that sponsorship is not available at this time; and that you must be located within 45 miles of a talent hub to be considered.

Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly.

Follow Us!

YouTube | LinkedIn | X (Twitter) | Facebook

Skills Required

  • Master's degree with 2+ years experience or Ph.D. with 1+ years experience in data science
  • Proficiency in Python or Scala
  • Experience writing and optimizing SQL for large datasets
  • Hands-on experience with Spark or PySpark and ML libraries
  • Familiarity with UNIX and AWS ecosystem
  • Knowledge of supervised/unsupervised ML and basic statistics
  • Exposure to graph techniques or databases
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chennai, Tamil Nadu
386 Employees
Year Founded: 2012

What We Do

Socure is the leading platform for digital identity trust. Its predictive analytics platform applies artificial intelligence and machine learning techniques with trusted online/offline data intelligence from email, phone, address, IP, device, velocity, and the broader internet to verify identities in real time. The company has more than 750 customers across the financial services, gaming, telecom, and e-commerce industries, including three of the top five banks, seven of the top 10 card issuers, three of the top MSBs, the top payroll provider, the top credit bureau, and over 100 of the largest and most successful FinTechs. Marquee customers include Chime, Varo Money, Public, Stash, and DraftKings. Socure has received numerous industry awards and accolades, including being named to Forbes America’s Best Startup Employers 2021, being awarded Best New Technology Introduced over the Last 12 Months – Data and Data Services at the 2020 American Financial Technology Awards (AFTAs), being ranked number 70 in Deloitte’s Technology Fast 500™, being listed as a Gartner Cool Vendor, being recognized by Forbes as one of the Top 25 Machine Learning Startups to Watch, being named to CB Insights: The FinTech 250, and being awarded Finovate’s Award for Best Use of AI/ML, to name a few.

Why Work With Us

Socure is a critical part of the infrastructure of the digital economy and what we do is critical to ensure the safety of anyone doing any sort of business on the internet. Because of our technology digital identity theft will be eradicated and more people will be included in the digital economy than ever before.

Gallery

Gallery

Similar Jobs

Mondelēz International Logo Mondelēz International

Waste Portfolio Manager

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
48 Locations
90000 Employees
122K-168K Annually

MetLife Logo MetLife

Group Insurance Administrator

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
46K-46K Annually

Wipfli Logo Wipfli

Audit Manager, Manufacturing Industry

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
Minneapolis, MN, USA
3000 Employees
110K-166K Annually

Wipfli Logo Wipfli

Audit Manager, Tribal Industry

Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
Remote or Hybrid
United States
3000 Employees
97K-145K Annually

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
31 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account