At Snowflake, we are powering the era of the agentic enterprise. To usher in this new era, we seek AI-native thinkers across every function who are energized by the opportunity to reinvent how they work. You don’t just use tools; you possess an innate curiosity, treating AI as a high-trust collaborator that is core to how you solve problems and accelerate your impact. We look for low-ego individuals who thrive in dynamic and fast-moving environments and move with an experimental mindset — who rapidly test emerging capabilities to discover simpler, more powerful ways to deliver results. At Snowflake, your role isn't just to execute a function, but to help redefine the future of how work gets done.
We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems.
Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models. This is an exciting opportunity to collaborate with a world-class team, including founding members of DeepSpeed, vLLM, and TensorFlow. Together, we will push the boundaries of deep learning systems and drive cutting-edge innovations in AI.
Responsibilities:
Analyze and optimize GPU kernel performance for training and inference of LLMs.
Develop and implement strategies to enhance the efficiency and scalability of deep learning systems.
Profile and benchmark deep learning systems using tools and techniques to identify bottlenecks.
Design and implement optimizations to reduce latency and improve resource utilization for training and inference.
Stay updated with the latest advancements in GPU kernel optimization, deep learning, and LLM system development.
Contribute to the development of agentic frameworks and applications for LLM-driven workflows, enhancing automation, reasoning, and decision-making capabilities.
Open-source and publish innovations, optimizations, and engineering practices in technical blogs, top-tier conferences and journals.
Requirements:
Bachelor’s degree in Computer Science, Electrical Engineering, or a related field. A Master’s degree or PhD is preferred.
5 years of experience in GPU kernel optimization, deep learning system optimization, or high-performance computing (HPC).
Proficiency in deep learning frameworks such as PyTorch, TensorFlow, JAX.
Strong understanding of GPU architectures and experience with CUDA or similar frameworks.
Experience with frameworks like CUTLASS, Triton, cuDNN, etc.
Experience with profiling tools (e.g., nvprof, Nsight) and performance analysis methodologies.
Solid problem-solving skills and ability to debug complex performance issues.
Excellent communication skills and ability to work effectively in a cross-functional team environment.
Join us in optimizing deep learning systems and pushing the boundaries of AI efficiency. Apply now to be part of our dynamic and pioneering team!
Snowflake is growing fast, and we’re scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.
How do you want to make your impact?
For jobs located in the United States, please visit the job posting on the Snowflake Careers Site for salary and benefits information: careers.snowflake.com
Skills Required
- Bachelor's degree in Computer Science, Electrical Engineering, or related field
- Master's degree or PhD (preferred)
- 5 years experience in GPU kernel optimization, deep learning system optimization, or high-performance computing (HPC)
- Proficiency with deep learning frameworks such as PyTorch, TensorFlow, JAX
- Strong understanding of GPU architectures and experience with CUDA or similar frameworks
- Experience with frameworks like CUTLASS, Triton, cuDNN
- Experience with profiling tools (e.g., nvprof, Nsight) and performance analysis methodologies
- Ability to debug complex performance issues and strong problem-solving skills
- Excellent communication skills and ability to work in cross-functional teams
Snowflake Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Snowflake and has not been reviewed or approved by Snowflake.
-
Equity Value & Accessibility — Equity grants (RSUs) and an ESPP are central to total compensation and are described as highly valuable. Feedback suggests many see equity as a major satisfaction driver with meaningful upside potential.
-
Fair & Transparent Compensation — Pay is considered competitive and accompanied by clear communication on salary, equity, and advancement. Feedback suggests pay practices emphasize fairness and transparency.
-
Parental & Family Support — Paid parental leave, fertility benefits, adoption assistance, and family planning resources are notably comprehensive. Feedback suggests these programs materially support major life events.
Snowflake Insights
What We Do
Snowflake powers the end-to-end data lifecycle – from ingesting and processing data to analyzing and modeling it, to building and sharing data and AI applications – helping engineers, analysts, and leaders innovate faster and achieve more with their data. We're on a mission to empower every enterprise to achieve its full potential through data and AI.
Why Work With Us
Snowflake is where data does more, and so do you. More innovating, more growing, and more collaborating. Here, you’ll find the sweet spot between building big and moving fast, in technology and your career.
Gallery
.jpg)







