We are seeking a highly skilled Ceph Cluster Development & Operations Engineer with strong expertise in C++ systems programming to design, extend, and maintain enterprise-scale Ceph distributed storage clusters. The role involves deep development in Ceph core subsystems (RADOS, OSD, RGW, MDS), performance optimization, and operational excellence across multi-site, multi-zone architectures.
You will work closely with system architects, SREs, and cloud infrastructure teams to ensure the reliability, scalability, and security of mission-critical storage systems deployed across multiple data centers and Kubernetes environments.
Key Responsibilities
- Design, build, and operate large-scale Ceph clusters including RADOS, RGW, RBD
- Contribute to or extend Ceph core components written in C++ (e.g., OSD, RGW, librados, BlueStore, MGR modules).
- Profile and optimize performance across network, disk I/O, and replication layers (PG placement, CRUSH rules, BlueStore tuning).
- Develop automation and tooling for cluster lifecycle management (deployment, upgrades, scaling, failover, and recovery).
- Integrate Ceph with Kubernetes (via Rook-Ceph, CSI drivers) and CI/CD pipelines for continuous delivery.
- Implement and validate multi-site replication and disaster recovery architectures for high availability.
- Develop and maintain secure storage solutions using dm-crypt, KMS integration, and CephX authentication.
- Build observability pipelines using Prometheus, Grafana, and custom exporters for metrics and health analytics.
- Write and maintain SOPs, automation scripts, and system documentation to support production-grade operations.
- Collaborate with upstream Ceph community or maintain in-house forks for feature development and bug fixes.
Qualifications
Required Skills
- Strong proficiency in C++ (C++11 or later), with experience in large-scale distributed systems or kernel-adjacent development.
- Deep understanding of Ceph architecture and its core components: MON, OSD, MGR, RGW, MDS, and CRUSH maps.
- Proficient in Linux systems programming, debugging (gdb, perf, valgrind), and performance profiling.
- Experience with Python or Go for tooling and automation.
- Strong foundation in data replication, erasure coding, and consistency models in distributed storage.
- Hands-on experience with Kubernetes, Rook-Ceph, Helm, Ansible, and related DevOps tools.
- Familiarity with TCP/IP, HTTP/S3 APIs, block storage (RBD/iSCSI), and object storage semantics.
- Ability to conduct root-cause analysis and lead performance investigations under production environments.
Preferred Skills
- Contributions to the Ceph open-source project or prior experience modifying Ceph source code.
- Experience with multi-site replication, object versioning, compliance retention, or legal hold features.
- Background in distributed storage systems, file systems, or cloud storage platforms.
- Familiarity with containerized environments, network virtualization, and cloud-native observability stacks.
- Excellent technical documentation and communication skills in English.
The US base salary range for this full-time position is $179,000-$219,000. Fortinet offers employees a variety of benefits, including medical, dental, vision, life and disability insurance, 401(k), 11 paid holidays, vacation time, and sick time, as well as a comprehensive leave program.
Wage ranges are based on various factors, including the labour market, job type, and job level. Exact salary offers will be determined by factors such as the candidate's subject knowledge, skill level, qualifications, experience, and geographic location.
All roles are eligible to participate in the Fortinet equity program. Bonus eligibility is reviewed at the time of hire and annually at the Company’s discretion.
Why Join Us:
We encourage candidates from all backgrounds and identities to apply. We offer a supportive work environment and a competitive Total Rewards package to support you with your overall health and financial well-being.
Embark on a challenging, enjoyable, and rewarding career journey with Fortinet. Join us in bringing solutions that make a meaningful and lasting impact to our 660,000+ customers around the globe.
About UsSkills Required
- Strong proficiency in C++ (C++11 or later) with systems-level programming experience
- Deep understanding of Ceph core components (MON, OSD, MGR, RGW, MDS) and CRUSH maps
- Experience with RADOS, RBD, librados, BlueStore and Ceph internals
- Linux systems programming and debugging using gdb, perf, valgrind
- Experience with Python or Go for tooling and automation
- Hands-on experience with Kubernetes, Rook-Ceph, CSI drivers, Helm, and Ansible
- Experience building observability pipelines using Prometheus and Grafana (custom exporters)
- Knowledge of data replication, erasure coding, consistency models, and performance tuning (PG placement, CRUSH, BlueStore tuning)
- Experience with storage security: dm-crypt, KMS integration, and CephX authentication
- Familiarity with TCP/IP, HTTP/S3 APIs, block (RBD/iSCSI) and object storage semantics
- Ability to conduct root-cause analysis and lead performance investigations in production
- Contributions to the Ceph open-source project or prior experience modifying Ceph source code
- Experience with multi-site replication, object versioning, compliance retention, or legal hold features
- Background in distributed storage systems, file systems, or cloud storage platforms
- Familiarity with containerized environments, network virtualization, and cloud-native observability stacks
- Excellent technical documentation and communication skills in English
Fortinet Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Fortinet and has not been reviewed or approved by Fortinet.
-
Affordable Benefits — Medical coverage includes a no-premium HDHP option with HSA funding alongside PPO/HMO choices, reducing employee costs at enrollment. Eligibility starts on the date of hire and includes company-paid life and disability coverage.
-
Wellbeing & Lifestyle Benefits — Mental-health access through Modern Health and an EAP offers therapy, coaching, and counseling resources. Lifestyle extras such as legal assistance, commuter benefits, pet-insurance discounts, and subsidized meals and snacks enhance everyday support.
-
Strong & Reliable Incentives — Compensation packages include annual bonuses, sales commissions, and stock awards. Earnings potential can be strong in sales when quotas are realistic and attainment is high.
Fortinet Insights
What We Do
Fortinet develops and sells cybersecurity solutions.








