Research Scientist - NLP

Posted Yesterday
Be an Early Applicant
Cambridge, MA
Hybrid
Entry level
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Software • Generative AI
We make data discoverable, useful, and valuable in enabling fact-based decisions.
The Role
As a Research Scientist at Kensho, you will develop state-of-the-art NLP models, contribute to collaborative research projects, and share your findings. You'll work with techniques for large data processing and evaluate new directions in machine learning. Your role focuses on innovative problem-solving and iterative collaboration with a team.
Summary Generated by Built In

Kensho is S&P Global's hub for AI innovation and transformation. With expertise in Machine Learning and data discovery, we develop and deploy novel solutions for S&P Global and its customers worldwide. Our solutions help businesses harness the power of data and Artificial Intelligence to innovate and drive progress. Kensho's solutions and research focus on speech recognition, entity linking, document extraction, automated database linking, text classification, natural language processing, and more.
Are you looking to solve hard problems and enjoy working with teammates with diverse perspectives? If so, we would love to help you excel here at Kensho. We are a collaborative group of experienced Research Scientists and Machine Learning Engineers, whose academic backgrounds include doctorate degrees in NLP, theoretical physics, statistics, etc. We take pride in our team-based, tightly-knit startup-like Kenshin community, which fosters continuous learning and a communicative environment.
At Kensho, we hire talented people and give them the freedom, support, and resources needed to accomplish our shared goals. We believe in flexibility-first and give our employees the opportunity to work from where they feel most productive and engaged (must be in the United States). We also value in-person collaboration, so there may be times when travel to one of our Kensho hubs (e.g., Cambridge, MA or NYC) will be required for team meetings or company events.
About the R&D Lab:
Since 2022, we have been building a world-class R&D lab comprised of NLP Research Scientists, and we heavily prioritize publishing in top-tier conferences. Our small team has demonstrated compelling results and is fueling innovation throughout Kensho and S&P Global at large. Specifically, we are continuously developing Large Language Models (LLMs) and are actively working on long-context question-answering (QA), complex reasoning, tokenization, alignment (e.g., factuality), multi-document QA, and more!
Our small team has reserved access to hundreds of fast GPUs (A100s), spanning Cloud and on-prem machines.
Our current projects include:
- Long-context document QA, where the answer is contained within documents that are hundreds of pages in length
- Complex reasoning, including better understanding and improving models' ability to approximate numbers (related to commonsense reasoning).
- Creating rigorous evaluation benchmarks, spanning domain knowledge, quantity extraction, and program synthesis
- Improving existing alignment techniques for domain-specific needs, while also addressing factuality
- Dissecting tokenizers to better understand how each of the sub-components impact intrinsic and extrinsic performance
- Multi-Document QA where the answer requires combining information from dozens of sources.
- Retrieval-augmented generation (RAG) methods
- Creating high-quality data filters for LLM development
Additionally, we maintain strong relationships with academia, including collaborating on several ongoing projects, providing industry grants, sponsoring conferences, and jointly holding faculty positions.
Kensho states that the anticipated base salary range for the position is 150k-225k. In addition, this role is eligible for an annual incentive bonus and equity plans. At Kensho, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case.
Technologies & Tools We Use:

  • ML: PyTorch, Weights & Biases, NetworkX
  • Deployment: Airflow, Docker, EC2, Kubernetes, AWS
  • Datastores: Postgres, Elasticsearch, S3


What You'll Do:

  • Regularly reading late-breaking research papers and helping to identify pertinent directions of work
  • Developing novel, state-of-the-art NLP models that can scale to millions of documents
  • Working closely with other Research Scientists and ML Engineers
  • Writing clean, readable research code in PyTorch (not expected to write production-level code)
  • Contribute to a stellar engineering culture that values excellent design, documentation, testing, and code
  • Share your research results with your colleagues (presentations) and the world (published papers, patents, and blog posts)


What You'll Need:

  • Outstanding people come from all different backgrounds, and we're always interested in meeting talented people! Therefore, we do not require any particular credential or experience. If our work seems exciting to you, and you feel that you could excel in this position, we'd love to hear from you. That said, most of our successful candidates possess the following, which reflects both our technical needs and team culture:
  • Hold a PhD in Computer Science or related field (or a Master's with significant research experience)
  • Have published in a top-tier ML/NLP conference (e.g., ACL, NAACL, EMNLP, NeurIPS, ICML)
  • Are proficient in writing code in PyTorch, Tensorflow, or JAX
  • Have experience with the techniques required to work effectively with large, messy real-world data
  • Prefer to collaborate iteratively on hard problems with your teammates rather than spending stretches of time working alone and presenting your results intermittently
  • Have a love for learning new skills and domains
  • Are excited to share knowledge freely, proactively, and effectively with others who are interested
  • Are a generous teammate who takes work seriously without taking yourself too seriously


At Kensho, we pride ourselves on providing top-of-market benefits, including:

  • Medical, Dental, and Vision insurance
  • 100% company paid premiums
  • Unlimited Paid Time Off
  • 26 weeks of 100% paid Parental Leave (paternity and maternity)
  • 401(k) plan with 6% employer matching
  • Generous company matching on donations to non-profit charities
  • Up to $20,000 tuition assistance toward degree programs, plus up to $4,000/year for ongoing professional education such as industry conferences
  • Plentiful snacks, drinks, and regularly catered lunches
  • Dog-friendly office (CAM office)
  • Bike sharing program memberships
  • Compassion leave and elder care leave
  • Mentoring and additional learning opportunities
  • Opportunity to expand professional network and participate in conferences and events


We are an equal opportunity employer that welcomes future Kenshins with all experiences and perspectives. Kensho is headquartered in Cambridge, MA, with an additional office location in New York City. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin.
Job ID: 307697
Posted On: 2024-10-18
Location: Cambridge, Massachusetts, United States

Top Skills

Jax
PyTorch
TensorFlow

What the Team is Saying

Hannah
Lupe
Maury
Melissa
The Company
HQ: Cambridge, MA
100 Employees
Hybrid Workplace
Year Founded: 2013

What We Do

At Kensho, we leverage S&P Global’s world class data to research, develop and implement leading AI and machine learning capabilities that drive fact-based, objective decision making. Data is at the heart of Kensho, and it’s our technology. We build and deploy solutions that make that data accessible, insightful, relevant and transformative.

Why Work With Us

We are a diverse and inclusive group of curious, highly accomplished engineers and business professionals who value collaboration, curiosity, and mentorship at all levels. At Kensho, swinging for the fences is considered a team sport, and every Kenshin’s unique perspective and experiences are valued.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Kensho Technologies Teams

Team
Elevate Your Engineering Skills
About our Teams

Kensho Technologies Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Not Specified
HQCambridge, MA
New York, NY
Learn more

Similar Jobs

Kensho Technologies Logo Kensho Technologies

Senior Research Scientist

Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Software • Generative AI
Hybrid
Cambridge, MA, USA
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account