Responsibilities:
- Define product roadmaps for AI‑inference workflows, focusing on latency, throughput, and GPU utilization improvements.
- Engage deeply with technical users to understand bottlenecks in model serving, caching, and scaling and translate them into product specifications.
- Partner with engineering to design and deliver inference‑oriented features such as GPU scheduling, sharding, and streaming data access.
- Work with customers to validate features and measure their impact, incorporating feedback into iteration cycles.
- Stay current with AI infrastructure trends (hybrid clouds, edge inference, multi‑model serving) and integrate them into product planning.
Qualifications:
- 4-8 years of experience in product management, AI infrastructure, or ML engineering roles, with at least 2 years focused on AI/ML workloads.
- Deep understanding of AI/ML workflows, including model deployment, inference optimization, and data‑access patterns.
- Proven track record of delivering features with measurable improvements in latency, throughput, or GPU utilization, or equivalent experience.
- Technical proficiency in distributed systems and cloud platforms (Kubernetes, AWS/GCP/Azure) and familiarity with frameworks like PyTorch, TensorFlow, Triton Inference Server or similar.
- Excellent communication and cross‑functional leadership skills, enabling clear translation of complex technical concepts to varied audiences.
Why Join Alluxio?
- Be part of a world-class team dedicated to solving some of the toughest challenges in big data.
- Work in a dynamic and innovative environment with opportunities for professional growth and development.
- Enjoy a collaborative culture that values empathy, enthusiasm, and creativity.
Top Skills
What We Do
Proven at global web scale in production for modern data services, Alluxio is the developer of open source data orchestration software for the cloud. Alluxio moves data closer to big data and machine learning compute frameworks in any cloud across clusters, regions, clouds and countries, providing memory-speed data access to files and objects. Intelligent data tiering and data management deliver consistent high performance to customers in financial services, high tech, retail and telecommunications. Alluxio is in production use today at seven out of the top ten internet companies. Venture-backed by Andreessen Horowitz and Seven Seas Partners, Alluxio was founded at UC Berkeley’s AMPLab by the creators of the Tachyon open source project. For more information, contact [email protected].