Key Duties
- Design, develop, maintain and improve our multi-target runtime
- Use the latest techniques in parallelization and partitioning to automate generation and exploit highly optimized kernels
- Rapid prototyping and data driven exploration of new ideas
- Benchmark and analyze the outputs produced by our optimizing compiler on target hardware
- Work closely with our product team to understand the evolving needs of ML engineers and drive improvements in architecture
- Build tools to collect and analyze performance bottlenecks
Essential Skills and Experience
- A deep understanding of asynchronous, concurrent programming.
- 4+ years of experience with C/C++ (C++14 or newer).
- An understanding of HW architecture (vector vs scalar registers and instructions, memory hierarchies).
- Knowledge of operating system kernel development or hypervisor development.
Preferred Skills and Experience
- Experience developing or maintaining libraries like CUDA or ROCm.
- Experience with GPU programming.
- Experience with high performance computing (HPC).
- Masters or PhD degree in computer science, or equivalent practical experience.
- Knowledge of DL frameworks such as PyTorch, JAX or Triton.
- Experience with programming large compute clusters.
Top Skills
What We Do
At Lemurian Labs our focus is on unleashing the capabilities of AI for the benefit of humanity. To fulfill this purpose we are developing a full stack solution consisting of software and hardware that is capable of orders of magnitude better performance and efficiency than legacy solutions, while being designed for scalability. There are massive shifts underway moving us from Software 1.0 to Software 2.0 to Software 3.0 and onwards, but to realize its true benefits we need fundamentally new hardware and systems that can keep up with the changing compute demands and simultaneously bringing down costs. We are developing software and hardware designed from first principles to deliver unprecedented realizable performance/watt and enable the next generation of AI workloads. Our diverse team of technologists have decades of experience at the frontiers of high performance computing, digital arithmetic, cryptography, artificial intelligence, robotics, and networking. There is a lot of talk about what the technology of tomorrow will look like and there are a number of companies developing it. At Lemurian, we believe tomorrow is so yesterday. We are developing the technology for the day after tomorrow. We are Lemurian Labs. Welcome to the future of artificial intelligence and computing.







