AI21 Labs

Deep Learning Engineer

Posted 8 Days Ago

Be an Early Applicant

Senior level

Artificial Intelligence • Software

The Role

As a Deep Learning Engineer, you'll develop and optimize large-scale language models, improve training infrastructure, and evaluate model performance across benchmarks.

Summary Generated by Built In

Description

Our team is looking for a Deep Learning Engineer.

AI21 is one of the few companies to have trained multi-billion parameter Large Language Models (LLMs), a feat that involves the most advanced engineering (large scale distributed training on thousands of cores). Serving these LLMs efficiently requires cutting-edge technology as well. As a deep learning engineer on the team, you will be responsible for maintaining and improving our training infrastructure, developing/scaling/testing new ideas, and adapting our code to run on and best utilize the newest and most advanced hardware accelerators.

Role and Responsibilities

Develop Large Language Models as part of our applied research projects and in support of AI21 Platform, including designing, implementing and training massive-scale deep language models
Implement, optimize, scale and test new cutting edge ideas and architectures
Perform large-scale evaluations and comparisons of trained models across a range of benchmarks, as well as adding support for new benchmarks

Requirements

B.Sc. in computer science, software engineering or equivalent
Self learner, and proven record of ability to remove technical road-blocks
5+ years experience developing software for production systems and/or internal infrastructure/tools
Prior experience working with cloud computing platforms (e.g. AWS, GCP, Docker, Kubernetes)
Skilled at writing production-grade Python code
Hands-on experience in deep learning and machine learning (TensorFlow/PyTorch..)

Any one of the following:

Optimization of deep learning model training (E.g. parallelization, megatron, deepspeed, FSDP)

- or -

Custom kernel experience (C++/CUDA and/or Triton)

- or -

Distributed Systems, in particular distributed deep learning training/serving

About Us

AI21 Labs is pioneering the development of Foundation Models and AI Systems for enterprises, accelerating the adoption of Generative AI in production.

Established in 2017 by AI visionaries Prof. Amnon Shashua, Prof. Yoav Shoham, and Ori Goshen, our mission is to equip businesses with cutting-edge LLMs and AI capabilities. Backed by leading investors like Pitango, Google, Nvidia, Intel Capital, and Comcast Ventures.

Join us on this exciting journey and advance your career with AI21 Labs!

Top Skills

AWS

C++

Cuda

Docker

GCP

Kubernetes

Python

PyTorch

TensorFlow

Triton

View all jobs at AI21 Labs

View AI21 Labs Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

Upload Resume

The Company

HQ: Tel Aviv

276 Employees

On-site Workplace

Year Founded: 2017

What We Do

AI21 is pioneering the development of enterprise AI Systems and foundation models. Our mission is to transform cutting-edge deep tech research into enterprise-ready AI systems. We offer privately deployed models with unmatched security, privacy and reliability with tailored solutions for every organization. Founded in 2017, AI21 has raised $336 million from leading investors including NVIDIA, Google and Intel.