What You’ll Own
- Cache and metadata enhancements - design and implement improvements to caching policies, eviction logic, and metadata scalability to increase performance and reliability.
- Data path optimization - refine I/O pipelines for S3/GCS/HDFS/Posix to reduce latency and improve throughput using concurrency and scheduling techniques.
- Distributed systems reliability - strengthen consistency, replication, and fault-tolerance mechanisms across large-scale clusters.
- Feature development and integration - collaborate with product and solution-engineering teams to deliver features that support AI and analytics workloads.
- Code quality and peer collaboration - participate in design reviews, provide constructive feedback, and ensure robust testing and observability in production systems.
What You’ll Do
- Design, build, and optimize distributed components within Alluxio’s orchestration layer.
- Investigate performance bottlenecks and propose scalable solutions using profiling, tracing, and benchmarking tools.
- Collaborate cross-functionally with fellow engineers, architects, and the open-source community to drive improvements.
- Contribute to releases and stability efforts, ensuring enterprise-grade reliability across global deployments.
What We’re Looking For
- Strong computer-science fundamentals and a passion for large-scale distributed systems.
- Professional experience developing in Java, C++, or Go.
- Practical knowledge of concurrency, replication, distributed coordination, and performance tuning.
- Experience with distributed storage, caching, or data-access layers (e.g., Spark, Presto, Hadoop, Kubernetes).
- Bachelor’s or advanced degree in Computer Science or related technical field (or equivalent experience).
Why Alluxio?
- Build infrastructure trusted by the world’s largest AI and data-driven companies.
- Join a small, senior engineering team where your designs shape the product’s evolution.
- Work directly with the original creators of open-source Alluxio.
- A culture of empathy, curiosity, and ownership - where engineers collaborate closely to solve hard problems.
Top Skills
What We Do
Proven at global web scale in production for modern data services, Alluxio is the developer of open source data orchestration software for the cloud. Alluxio moves data closer to big data and machine learning compute frameworks in any cloud across clusters, regions, clouds and countries, providing memory-speed data access to files and objects. Intelligent data tiering and data management deliver consistent high performance to customers in financial services, high tech, retail and telecommunications. Alluxio is in production use today at seven out of the top ten internet companies. Venture-backed by Andreessen Horowitz and Seven Seas Partners, Alluxio was founded at UC Berkeley’s AMPLab by the creators of the Tachyon open source project. For more information, contact [email protected].