Software Engineer, Acceleration Kernel Development

Posted 3 Days Ago
Be an Early Applicant
Toronto, ON, CAN
In-Office
100K-500K Annually
Mid level
Hardware • Manufacturing
The Role
Design, implement, and optimize low-level compute kernels for parallel ML workloads. Write high-performance C/C++ code, tune instruction-level latency, memory, and bandwidth, profile and debug a low-level software stack, and collaborate with ML and hardware engineers to integrate optimizations into production ML pipelines.
Summary Generated by Built In

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

As a Software Engineer on the Acceleration Kernel Development team at Tenstorrent, you’ll work at the intersection of software and hardware performance. You’ll be writing low-level code that directly powers high-efficiency machine learning workloads, optimizing every cycle, every memory move, every instruction. If you're motivated by performance, precision, and real impact, this is where your skills will shine.

This role is hybrid, based out of Toronto, ON.

We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.


Who You Are

  • A developer who loves high performance code, parallel algorithms, wrangling bits, optimizing compute, and making hardware fly.
  • Great in C/C++ and able to build fast, efficient code from the ground up.
  • Obsessed with performance and precision, especially in ML workloads.
  • Motivated by complex problems and thrives in collaborative, fast-moving environments.

What We Need

  • Expertise in building and optimizing compute kernels for parallel ML and high-performance workloads.
  • Ability to analyze and tune instruction-level performance across latency, memory, and bandwidth.
  • A collaborative mindset to work closely with ML engineers and integrate optimizations into production.
  • Ownership of debugging, profiling, and maintaining a fast, reliable low-level software stack.

What You Will Learn

  • The art of pushing AI hardware to its limits by shaping how kernels are written and executed.
  • How to integrate kernel work into ML frameworks and real-world training pipelines.
  • Skills in tuning performance on cutting-edge architectures with top-tier hardware engineers.
  • Expertise in keeping code lean, reliable, and scalable even under heavy workloads.

Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology.  Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2).   These requirements apply to persons located in the U.S. and all countries outside the U.S.  As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency.  If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.

Skills Required

  • Proficiency in C/C++ and building fast, efficient low-level code
  • Expertise building and optimizing compute kernels for parallel ML and high-performance workloads
  • Ability to analyze and tune instruction-level performance across latency, memory, and bandwidth
  • Experience debugging, profiling, and maintaining a fast, reliable low-level software stack
  • Collaborative mindset to work closely with ML engineers and hardware engineers
  • Obsessive focus on performance and precision in ML workloads
  • Hybrid work in Toronto, ON (based out of Toronto)
  • Eligibility to access U.S. export-controlled technology (citizenship/permanent residency or ability to obtain license)
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Toronto, ON
389 Employees
Year Founded: 2016

What We Do

Tenstorrent is a next-generation computing company that builds computers for AI. Headquartered in Toronto, Canada, with U.S. offices in Austin, Texas, and Silicon Valley, and global offices in Belgrade and Bangalore, Tenstorrent brings together experts in the field of computer architecture, ASIC design, advanced systems, and neural network compilers. Join us: www.tenstorrent.com/careers

Similar Jobs

Capco Logo Capco

Back-End - .NET, API, AI Ready

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Hybrid
Toronto, ON, CAN
6000 Employees
118K-152K Annually

Pfizer Logo Pfizer

Internal Medicine Health & Science System Specialist - Oakland - Stockton - Fresno, CA

Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
In-Office or Remote
3 Locations
121990 Employees
109K-251K Annually

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Supervisor I

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
Ottawa, ON, CAN
16000 Employees
21-26 Hourly

DraftKings Logo DraftKings

Senior Machine Learning Engineer

Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Remote or Hybrid
Canada
6400 Employees

Similar Companies Hiring

Fortune Brands Innovations Thumbnail
Manufacturing
Deerfield, IL
2450 Employees
Fairly Even Thumbnail
Hardware • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Amalgamated Sugar Thumbnail
Food • Greentech • Agriculture • Industrial • Manufacturing
Boise, Idaho
768 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account