Data Scientist II

Posted Yesterday
23 Locations
In-Office
97K-184K Annually
Mid level
Artificial Intelligence • Consumer Web • Digital Media • Software
The Role
Design, build, and deploy ML and AI systems at scale for content classification, recommendations, search, and metadata enrichment. Work with cross-functional teams to process large datasets, develop models from classical NLP to LLMs, implement production pipelines, and communicate results to stakeholders.
Summary Generated by Built In

Scribd, Inc. is on a mission to advance human understanding. Our four products — Scribd®, Slideshare®, Everand™, and Fable — help billions of people across the globe move beyond access and into insight, application, and expertise.

Culture at Scribd, Inc.

We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.

We believe the best work happens when individual flexibility is balanced with meaningful community connection. Scribd Flex empowers employees to choose the workstyle and location that support their best performance, while committing to intentional in-person moments that strengthen collaboration and culture. Occasional in-person attendance is required for all Scribd, Inc. employees, regardless of location.

So what are we looking for in new team members? At Scribd, Inc., we hire for “GRIT.” Traditionally defined as the intersection of passion and perseverance toward long-term goals, GRIT reflects the mindset we expect from every employee. For us, it also serves as a practical framework for how we work: setting and achieving Goals, delivering Results within your role, contributing Innovative ideas and solutions, and strengthening the broader Team through collaboration and attitude.

This posting reflects an approved, open position within the organization.

About the team

The Applied Research team is a group of data scientists and content specialists who are experts in leveraging machine learning, natural language processing and generative AI models to develop solutions which deliver value to our users and business.

We act as a key driver for innovation, whether it’s in product surface experimentation, metadata generation or model development. Along with Product and Engineering partners, we design solutions and collaborate in cross-functional squads to maximize business impact.

Our areas of impact include content enrichment, representation learning, recommendations, search, translation and many others, applied to diverse media across text, image, and audio. We operate at a scale of hundreds of millions of documents, millions of users and billions of user interactions.

Role Overview

We are seeking a Data Scientist II with experience developing and deploying machine learning models. You will help design and implement high impact AI and ML systems. We work in cross-functional teams collaborating with Machine Learning Engineers, Data Engineers and Product. We are seeking a curious and collaborative individual with an eye for simplicity, end-end visibility and impact and that is excited about building models using massive amounts of data, using language models and deploying models.


Responsibilities

  • Focus on a variety of content classification use cases, leveraging everything from traditional NLP to sophisticated LLMs and generative models

  • Investigate methods of solving our most challenging problems at Scribd, at scale

  • Collaborate with other Data Scientists, Machine Learning Engineers and ML Data Engineers on cross-functional projects

  • Leverage any algorithm at your disposal: from classical Scikit-learn and NumPy models to custom Neural Networks in PyTorch to third party LLM APIs

  • Process massive amounts of data with Python, SQL and Spark

  • Align with stakeholders through written and verbal communications methods on the approaches and results of projects, while writing detailed, accurate and concise project documentation

Requirements

  • 3+ years of post qualification experience developing machine learning models, working with systems at scale and deploying to production environments.

  • Proficiency in Python.

  • Hands-on experience building ML pipelines and working with distributed data processing frameworks like Apache Spark, Databricks, or similar.

  • Intermediate level in at least three of these fields: classification algorithms, natural language processing, search, information retrieval, named entity recognition, deep learning, generative models.

  • Intermediate level or greater experience with SQL or PySpark.

  • Bachelors or Masters in relevant quantitative discipline including but not limited to Statistics, Computer Science, Data Science, Artificial Intelligence or another field with a strong quantitative focus.

At Scribd, your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role, level, and geographic location. San Francisco is our highest geographic market in the United States. In the state of California, the reasonably expected salary range is between $118,000 [minimum salary in our lowest geographic market within California] to $184,000 [maximum salary in our highest geographic market within California].

In the United States, outside of California, the reasonably expected salary range is between $97,000 [minimum salary in our lowest US geographic market outside of California] to $175,000 [maximum salary in our highest US geographic market outside of California].

In Canada, the reasonably expected salary range is between $123,000 CAD[minimum salary in our lowest geographic market] to $164,000 CAD[maximum salary in our highest geographic market].

We carefully consider a wide range of factors when determining compensation, including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been scoped. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership, and a comprehensive and generous benefits package.

Working at Scribd, Inc.

Are you currently based in a location where Scribd, Inc. can employ you?
Employees must have their primary residence in or near one of the following cities. This includes surrounding metro areas or locations within a typical commuting distance:


United States:

Atlanta | Austin | Boston | Dallas | Denver | Chicago | Houston | Jacksonville | Los Angeles | Miami | New York City | Phoenix | Portland | Sacramento | Salt Lake City | San Diego | San Francisco | Seattle | Washington D.C.

Canada:

Ottawa | Toronto | Vancouver

Mexico:

Mexico City

Benefits at Scribd, Inc.

  • Scribd Flex (flexible work model)

  • Comprehensive health, dental, and vision coverage

  • Mental health support and disability coverage

  • Generous paid time off, including vacation, sick time, holidays, winter break, volunteer time, and sabbaticals

  • Paid parental leave and family support benefits

  • Retirement matching and employee equity

  • Learning and development programs and professional growth opportunities

  • Wellness and home office stipends

  • Complimentary access to the Scribd, Inc. suite of products

  • Enterprise access to leading AI tools

Get to Know Scribd, Inc.
About Scribd, Inc.
Life at Scribd, Inc.

We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing [email protected] about the need for adjustments at any point in the interview process.

If you apply for a job with Scribd or otherwise engage with us in connection with employment (including as an employee, contractor, or other personnel), the personal information we process in that context is subject to our Employee and Applicant Privacy Policy, which is available here.

Scribd, Inc. is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

Skills Required

  • 3+ years developing machine learning models, working with systems at scale, and deploying to production environments.
  • Proficiency in Python.
  • Hands-on experience building ML pipelines and working with distributed data processing frameworks like Apache Spark or Databricks.
  • Intermediate level in at least three: classification algorithms, natural language processing, search, information retrieval, named entity recognition, deep learning, generative models.
  • Intermediate level or greater experience with SQL or PySpark.
  • Bachelor's or Master's in a quantitative discipline (Statistics, Computer Science, Data Science, AI, or similar).
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
294 Employees

What We Do

Scribd, Inc. is an AI-powered knowledge company on a mission to advance human understanding. It operates a digital ecosystem comprising four primary products—Scribd, Slideshare, Everand, and Fable—which provide global access to a vast library of ebooks, audiobooks, and user-uploaded documents. By leveraging human-created content and intelligent tools, the company helps users move beyond simple access toward insight, application, and professional expertise.

Similar Jobs

Socure Logo Socure

Data Scientist

Artificial Intelligence • Machine Learning • Software • Analytics
Remote or Hybrid
6 Locations
386 Employees
130K-150K Annually

Socure Logo Socure

Data Scientist

Artificial Intelligence • Machine Learning • Software • Analytics
Remote or Hybrid
5 Locations
386 Employees
140K-170K Annually
In-Office or Remote
2 Locations
1987 Employees
83K-127K Annually

Socure Logo Socure

Data Scientist

Artificial Intelligence • Machine Learning • Software • Analytics
Remote or Hybrid
5 Locations
386 Employees
140K-170K Annually

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account