Cloudflare's Infrastructure Team builds and runs the systems and software that support our solutions that handle trillions of requests per month. We ensure that all of the new and existing features and functionality that Cloudflare offers can be managed at scale and meet the needs of our massively growing, global customer base.
What you'll do
As a Software Engineer: Resiliency, you will be part of a Resiliency Organization responsible for developing and maintaining the systems that manage Cloudflare's infrastructure at scale. We are looking for engineers to join the Maintenance Optimizer team and shape the future of infrastructure reliability. We're building a cutting-edge Maintenance Coordination System in Typescript & Go that will be pivotal in guaranteeing our service level capacity, dynamically allocating hardware resources, and supporting our critical capacity SLOs. This is a unique opportunity to work on complex, globally distributed systems which underpin core infrastructure and all Cloudflare products. We ensure the seamless operation of our global network. You will play a key role in the development of the infrastructure that powers Cloudflare's scale.
You will collaborate with the team to understand business needs and develop technical solutions. You will thrive in a fast-paced iterative engineering environment and work closely with internal customers to understand their requirements.
Technologies we use:
- Cloudflare Workers, Workers KV, R2, and Durable Objects
- Kubernetes
- Go, Typescript, Python
- For service monitoring we use Prometheus, Grafana and Sentry
Because you'll be solving problems of massive scale and significance, and shaping the future of the Internet, you are a growth-oriented individual who enjoys being outside of your comfort zone in a fast-paced environment.
Examples of desirable skills, knowledge and experience
As an ideal candidate for this position, you are curious, hard-working, and passionate.
A rough list of the skills we would love to see you bring:
- A degree in Computer Science, Engineering, Mathematics, Statistics or related field; OR have relevant background/experience to the field.
- Programming experience in Go, or similar languages
- Experience in designing and implementing secure and highly-available distributed systems
- Experience (and love) for debugging to ensure the system works in all cases
- Experience with a continuous integration workflow and using source control (we use git)
- Experience with continuous delivery and deployment of a k8s hosted application
- Understanding of security issues and responsibilities
- Experience with monitoring, alerting and debugging high volume production systems
- Fluent in analyses of data sets such as logs
- Strong English language oral and written communications skills
- Designing and building APIs
- Experience with the Cloudflare development stack is a plus
Examples of desirable skills, knowledge and experience
- At least 4 years of hands-on software development experience on meaningfully complex systems.
- Experience building both backend systems and frontend widgets.
- Ability to contribute to planning, development, and execution to meet commitments and deliver with predictability.
- Experience implementing tools, processes, internal instrumentation, and methodologies.
- Comfortable working on projects with tight deadlines and short release cycles.
- Strong verbal and written English language skills.
- Experience with DCIM, CMDB, IPAM, and other Data Center and Asset Lifecycle Management tools is a plus.
- Experience with data ingestion and analysis - pulling metrics from hundreds of edge data centers.
- Experience with graph theory - building a health graph on one of the world's largest physical networks.
Compensation
Compensation may be adjusted depending on work location.
- For Washington D.C. based hires: Estimated annual salary of $140,000 - 172,000.
Equity
This role is eligible to participate in Cloudflare's equity plan.
Benefits
Cloudflare offers a complete package of benefits and programs to support you and your family. Our benefits programs can help you pay health care expenses, support caregiving, build capital for the future and make life a little easier and fun! The below is a description of our benefits for employees in the United States, and benefits may vary for employees based outside the U.S.
Health & Welfare Benefits
- Medical/Rx Insurance
- Dental Insurance
- Vision Insurance
- Flexible Spending Accounts
- Commuter Spending Accounts
- Fertility & Family Forming Benefits
- On-demand mental health support and Employee Assistance Program
- Global Travel Medical Insurance
Financial Benefits
- Short and Long Term Disability Insurance
- Life & Accident Insurance
- 401(k) Retirement Savings Plan
- Employee Stock Participation Plan
Time Off
- Flexible paid time off covering vacation and sick leave
- Leave programs, including parental, pregnancy health, medical, and bereavement leave
Top Skills
What We Do
Cloudflare, Inc. (NYSE: NET) is the leading connectivity cloud company on a mission to help build a better Internet. It empowers organizations to make their employees, applications and networks faster and more secure everywhere, while reducing complexity and cost. Cloudflare’s connectivity cloud delivers the most full-featured, unified platform of cloud-native products and developer tools, so any organization can gain the control they need to work, develop, and accelerate their business.
Powered by one of the world’s largest and most interconnected networks, Cloudflare blocks billions of threats online for its customers every day. It is trusted by millions of organizations – from the largest brands to entrepreneurs and small businesses to nonprofits, humanitarian groups, and governments across the globe.
Why Work With Us
Cloudflare employees come from all walks of life. We are mission-driven, and our team is energized by a collaborative, creative environment that celebrates our differences and fosters new ways to grow together.
Gallery
Cloudflare Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.
We are committed to developing a global team that is distributed with a flexible working approach. Doing this equitably and inclusively is essential to our success. Visit our careers site for more on 'How & Where We Work.'