Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for building big, moving fast, and taking technology — and careers — to the next level.
We are looking for a detail-oriented and analytical ML Infrastructure to join our Observability team. In this role, you will be responsible for building critical infrastructure that ensures the accuracy, relevance, and overall quality of AI applications running on Snowflake infrastructure. You will collaborate closely with researchers, software engineers, and product managers to develop the leading AI application observability and optimization platform.
Responsibilities:Develop end-to-end AI observability systems, including monitoring, logging, tracing, and alerting pipelines.
Design and implement scalable and efficient tracing and event processing for large-scale data ingestion.
Optimize evaluation performance through improvements in inference infrastructure and distributed computing techniques.
Implement monitoring and analytics tools to track model quality metrics and a comprehensive view of model health.
Collaborate with cross-functional teams to integrate agentic AI apps built at snowflake into the observability platform.
Stay updated with industry trends and advancements in agentic AI to continuously enhance quality.
Participate in on-call rotations and provide support for production systems.
Bachelor’s degree in Computer Science, Engineering, or a related field. Master’s degree preferred.
Proven experience 2 years plus in designing and developing production systems for Machine Learning and AI.
Proficiency in programming languages such as Java, C++ or Python.
Experience developing or using observability infrastructure such as OpenTelemetry for large-scale applications is a plus.
Excellent problem-solving skills and ability to troubleshoot complex issues in a production environment.
Strong communication skills and ability to collaborate effectively in a team environment.
Ability to adapt to a fast-paced and evolving technological landscape.
Experience contributing to and maintaining open source is a plus
Every Snowflake employee is expected to follow the company’s confidentiality and security standards for handling sensitive data. Snowflake employees must abide by the company’s data security plan as an essential part of their duties. It is every employee's duty to keep customer information secure and confidential.
Snowflake is growing fast, and we’re scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.
How do you want to make your impact?
For jobs located in the United States, please visit the job posting on the Snowflake Careers Site for salary and benefits information: careers.snowflake.com
Similar Jobs
What We Do
Snowflake powers the end-to-end data lifecycle – from ingesting and processing data to analyzing and modeling it, to building and sharing data and AI applications – helping engineers, analysts, and leaders innovate faster and achieve more with their data.
We're on a mission to empower every enterprise to achieve its full potential through data and AI.
Why Work With Us
Snowflake is where data does more, and so do you. More innovating, more growing, and more collaborating. Here, you’ll find the sweet spot between building big and moving fast, in technology and your career.
Gallery
