AI Software Engineer (Platform Software)

Posted 5 Days Ago
9 Locations
In-Office or Remote
Junior
Artificial Intelligence • Information Technology • Software • Database • Semiconductor • Manufacturing
The Role
Develop and optimize AI models in PyTorch for NPU architecture, analyze existing frameworks, and collaborate with the compiler team to enhance performance.
Summary Generated by Built In

About the Job
  • FuriosaAI is looking for passionate AI Software Engineers to join our Platform Team. You will participate in the research and development of models optimized for our NPU accelerator.

  • Our team builds the production-grade, streamlined AI software that makes up our SDK. This includes the runtime, LLM serving framework, and PyTorch models/extensions.

  • Your work on these critical parts of the SDK will directly enable AI developers to efficiently deploy optimized AI models on FuriosaAI NPUs.

Responsibilities
  • Develop and optimize DNN model implementations in PyTorch for FuriosaAI's Tensor Contraction Processor (TCP) architecture

  • Analyze the features, implementations, CUDA and Triton kernels of existing AI model inference frameworks such as vLLM, TensorRT-LLM, and DeepSpeed-MII

  • Research and implement generative AI models, parallelism strategies, and inference techniques to improve performance and efficiency

  • Collaborate closely with the compiler team to optimize and enable models.

Minimum Qualifications
  • BS degree in Computer Science, Engineering, or a related field, or equivalent industry experience

  • Proficiency in Python programming skill

  • Experience in developing AI models in DNN frameworks (e.g., PyTorch)

  • Solid understanding of machine learning, deep learning, natural language processing (NLP), and/or generative AI models

  • Strong communication skills with the ability to collaborate effectively across cross-functional teams

Preferred Qualifications
  • Hands-on experience with PyTorch 2.0 technologies (e.g., TorchDynamo) or DNN compiler technologies, such as Triton and MLIR

  • Proficiency in C++/CUDA or Rust programming skills

  • Hands-on experience deploying and optimizing large-scale ML models in production

  • Hands-on experience in model training and fine-turning of pre-trained models

  • Experience in LLM inference frameworks: vLLM, TensorRT-LLM, and DeepSpeed-MII

  • Strong background in model quantizations and model evaluations

  • Strong background in machine learning, generative AI, and model evaluation techniques

  • Proven track record of contributing to open-source projects

Contact

Top Skills

C++
Cuda
Mlir
Python
PyTorch
Rust
Triton
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Seoul, Seoul
143 Employees
Year Founded: 2017

What We Do

FuriosaAI designs and develops data center accelerators for the most advanced AI models and applications.

Our mission is to make AI computing sustainable so everyone on Earth has access to powerful AI.

Our Background
Three misfit engineers with each from HW, SW and algorithm fields who had previously worked for AMD, Qualcomm and Samsung got together and founded FuriosaAI in 2017 to build the world’s best AI chips.

The company has raised more than $100 million, with investments from DSC Investment, Korea Development Bank, and Naver, the largest internet provider in Korea. We have partnered on our first two products with a wide range of industry leaders including TSMC, ASUS, SK Hynix, GUC, and Samsung. FuriosaAI now has over 140 employees across Seoul, Silicon Valley, and Europe.

Our Approach
We are building full stack solutions to offer the most optimal combination of programmability, efficiency, and ease of use. We achieve this through a “first principles” approach to engineering: We start with the core problem, which is how to accelerate.

Similar Jobs

Remote
3 Locations
127 Employees
225K-275K Annually

Dropbox Logo Dropbox

Senior Lead, Acquisition Growth Campaigns

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Remote
Canada
2500 Employees
141K-191K Annually

Affirm Logo Affirm

Product Manager

Big Data • Fintech • Mobile • Payments • Financial Services
Easy Apply
Remote
Canada
2200 Employees
178K-228K Annually

Motive Logo Motive

Marketing Operations Lead

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
Remote
Canada
4000 Employees
137K-205K Annually

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account