Staff Data Scientist - Identity Graph

Reposted 6 Days Ago
Hiring Remotely in USA
Remote
170K-205K Annually
Senior level
Artificial Intelligence • Machine Learning • Software • Analytics
Our mission is to verify 100% of good identities in real-time and completely eliminate identity fraud on the internet.
The Role
Lead advanced data science efforts for identity graph modeling. Collaborate across teams to enhance system performance and data quality, focusing on scalable identity solutions.
Summary Generated by Built In
Why Socure?

Socure is building the identity trust infrastructure for the digital economy — verifying 100% of good identities in real time and stopping fraud before it starts. The mission is big, the problems are complex, and the impact is felt by businesses, governments, and millions of people every day.

We hire people who want that level of responsibility. People who move fast, think critically, act like owners, and care deeply about solving customer problems with precision. If you want predictability or narrow scope, this won’t be your place. If you want to help build the future of identity with a team that holds a high bar for itself — keep reading.

About the Role

We are seeking a Staff Data Scientist to lead advanced data science and R&D efforts for the ID Graph, Socure’s foundational platform powering identity intelligence across our product ecosystem. This Staff-level role operates at platform scale, with responsibility extending beyond a single model or pipeline. You will work at the intersection of graph modeling, machine learning, and product innovation, collaborating closely with Engineering, Product Management, and multiple product teams. The ID Graph is the core intelligence backbone for many downstream products, and your work will directly impact Socure’s ability to deliver trusted, scalable, and explainable identity solutions.

What You'll DoEntity Resolution & Graph Evaluation
  • Lead the evaluation and continuous improvement of entity resolution and entity linking pipelines.

  • Debug new builds, identify anomalies, and recommend modeling or system-level improvements.

  • Define, implement, and maintain scalable performance and quality metrics, leveraging automation and LLM-based approaches where appropriate.

  • Partner with Engineering to optimize entity linking and ranking systems using Learning-to-Rank and related techniques.

  • Design methods to assess and classify entity confidence and quality across the graph.

Data Quality & Modeling Frameworks
  • Design and implement a comprehensive data quality framework for graph-based identity data.

  • Translate abstract quality concepts (e.g., reliability, stability, consistency) into measurable signals.

  • Use data quality insights to guide modeling decisions, experimentation strategy, and product prioritization.

Signal Discovery & Graph Intelligence
  • Identify and operationalize generalized, high-impact predictive signals derived from graph structure, temporal dynamics, and relational patterns.

  • Develop scalable approaches to link prediction, label propagation, and semi-supervised learning within the ID Graph.

  • Explore and evaluate advanced graph modeling techniques, including graph-based ML, knowledge graph methods, and Graph Neural Networks (GNNs), when appropriate.

  • Focus on durable abstractions rather than one-off features, ensuring solutions are explainable, compliant, and reusable across multiple products.

Cross-Functional Collaboration & Technical Leadership
  • Collaborate closely with Engineering, Product Management, Compliance, and downstream product teams.

  • Act as a technical leader within the Identity organization, influencing modeling standards, experimentation rigor, and best practices.

  • Translate complex technical findings into clear insights and recommendations for both technical and non-technical stakeholders.

  • Support the launch of new product capabilities built on top of the ID Graph.

Leadership Competencies
  • Demonstrate strong ownership, strategic impact, and assertive communication.

  • Mentor peers, foster a culture of growth, and build authentic relationships across teams.

  • Embrace feedback, adapt resiliently to challenges, and pursue continual self-improvement.

What You BringCore Technical Skills
  • Strong proficiency in Python and PySpark.

  • Deep experience with:

    • Classification models

    • Learning-to-Rank

    • Anomaly Detection

    • Statistical Modeling

  • Experience building and maintaining production-grade ML systems at scale.

Data & Platform Experience
  • Hands-on experience with Databricks.

  • Familiarity with graph databases and query languages such as NeptuneDB and OpenCypher.

  • Experience with graph processing frameworks (e.g., GraphFrames).

Preferred Experience
  • Experience applying LLMs for evaluation, automation, or signal discovery.

  • Familiarity with Knowledge Graphs and Graph Neural Networks (GNNs).

Leadership & Collaboration
  • Proven ability to drive cross-functional projects, mentor peers, and influence technical and business outcomes.

  • Excellent communication skills, with the ability to present technical concepts to both technical and non-technical audiences.

Education & Experience
  • Master’s or PhD in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field.

  • 5+ years of experience in applied data science, machine learning, or artificial intelligence, with a focus on graph-based modeling and large-scale data systems.

Socure is an equal opportunity employer that values diversity in all its forms within our company. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
If you need an accommodation during any stage of the application or hiring process—including interview or onboarding support—please reach out to your Socure recruiting partner directly.

Follow Us!

YouTube | LinkedIn | X (Twitter) | Facebook

Top Skills

Databricks
Graphframes
Neptunedb
Opencypher
Pyspark
Python
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chennai, Tamil Nadu
386 Employees
Year Founded: 2012

What We Do

Socure is the leading platform for digital identity trust. Its predictive analytics platform applies artificial intelligence and machine learning techniques with trusted online/offline data intelligence from email, phone, address, IP, device, velocity, and the broader internet to verify identities in real time. The company has more than 750 customers across the financial services, gaming, telecom, and e-commerce industries, including three of the top five banks, seven of the top 10 card issuers, three of the top MSBs, the top payroll provider, the top credit bureau, and over 100 of the largest and most successful FinTechs. Marquee customers include Chime, Varo Money, Public, Stash, and DraftKings.

Socure has received numerous industry awards and accolades, including being named to Forbes America’s Best Startup Employers 2021, being awarded Best New Technology Introduced over the Last 12 Months – Data and Data Services at the 2020 American Financial Technology Awards (AFTAs), being ranked number 70 in Deloitte’s Technology Fast 500™, being listed as a Gartner Cool Vendor, being recognized by Forbes as one of the Top 25 Machine Learning Startups to Watch, being named to CB Insights: The FinTech 250, and being awarded Finovate’s Award for Best Use of AI/ML, to name a few.

Why Work With Us

Socure is a critical part of the infrastructure of the digital economy and what we do is critical to ensure the safety of anyone doing any sort of business on the internet. Because of our technology digital identity theft will be eradicated and more people will be included in the digital economy than ever before.

Gallery

Gallery

Similar Jobs

MetLife Logo MetLife

Production Support Analyst - CRM & Workflow

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
85K-110K Annually

Enverus Logo Enverus

Account Director

Big Data • Information Technology • Software • Analytics • Energy
Remote
United States
1800 Employees
120K-150K Annually

EchoStar Logo EchoStar

Remote Retention Rep

Aerospace • Cloud • Digital Media • Information Technology • Mobile • News + Entertainment • Generative AI
In-Office or Remote
Sarasota, FL, USA
14500 Employees
34K-150K Hourly

EchoStar Logo EchoStar

Remote Sales Specialist

Aerospace • Cloud • Digital Media • Information Technology • Mobile • News + Entertainment • Generative AI
In-Office or Remote
Savannah, GA, USA
14500 Employees
34K-150K Hourly

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account