Senior Network Engineer

Posted 10 Days Ago
Hiring Remotely in San Francisco, CA
Remote
150K-200K Annually
7+ Years Experience
Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
We build infrastructure for machine learning
The Role
Being responsible for the L3 and below network infrastructure, managing the build out and bring up of large-scale compute infrastructure, implementing best practices for security and reliability, proactively detecting network issues, and triaging network incidents and outages.
Summary Generated by Built In

Voltage Park is building a cloud infrastructure business from the ground up. As part of this effort, we’re looking for a Senior Network Engineer to own everything L3 and below. You’ll play a pivotal role as a member of the founding team, responsible for bringing a substantial amount of infrastructure online across multiple data centers. As an early hire, you’ll also have outsize influence in defining the company’s culture and ensuring mission success.

This is a fully remote role, but you must be located in the United States. We are not able to provide sponsorship for this position. 

What you’ll do

  • Manage the build out and bring up of large scale compute infrastructure deployed at multiple sites across the United States.

  • Implement best practices for security, scalability, and reliability with a strong emphasis on software-based automation

  • Proactively detect network issues before they become a big problem

  • Triage network incidents and outages, identify and address technical debt

Requirements

  • You have comprehensive experience with Internet-scale routing tables, routing policies, peering policies, and architectures

  • Experience with technologies and protocols including BGP, EVPN, SDN, SD-WAN

  • You have hands-on experience working with SONiC, Infiniband

  • Experience with cloud networking technologies including VPC’s, NFV, Direct Connect, Cloud Connect

  • Familiarity with network topologies including Clos, Rail optimized, and fat tree

  • You know how to code in a scripting language like Python and have experience using automation tools like Ansible

  • Bonus: Linux or BSD system administration experience

  • You enjoy working with a small group of friendly, highly motivated, high-execution colleagues

  • You’re comfortable with a high degree of autonomy, can independently prioritize your work and understand how it maps to the overall needs and goals of the company

  • You’re knowledgeable in your domain but also enjoy wearing multiple hats and venturing outside of your comfort zone when the need arises

  • You value the ability to write well and understand the importance of good documentation

Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter. 

Compensation Range: $150K - $200K

Top Skills

Python

What the Team is Saying

Melissa Du
The Company
HQ: Berkeley, CA
45 Employees
Remote Workplace
Year Founded: 2023

What We Do

The market for cutting-edge ML compute is broken. Startups, researchers and even big AI labs are scrambling to buy or rent access to the latest chips for ML training. But demand far outstrips supply, and what’s available is only accessible to the well-resourced, placing an artificial damper on innovation.

To solve this challenge, we've launched Voltage Park, and we’re on a mission to make machine learning infrastructure accessible to all, from large enterprises and research universities, to seed-stage startups and nonprofits.

With around 24,000 NVIDIA H100 GPUs, the Voltage Park cloud is one of the most powerful collections of cutting-edge ML compute in the world. Our clusters consist of 80GB H100 SXM5 GPUs fully interconnected with 3.2T InfiniBand. We currently offer bare-metal access for large-scale users that need peak performance. We will add support for short-term leases and hourly billing soon as we spin up our infrastructure along with support for familiar tools like Slurm, Kubernetes, and Mosaic for easy integration into existing training frameworks.

Why Work With Us

You’ll play a pivotal role as a member of the founding team that will change the face of machine learning infrastructure. As an early hire, you’ll have outsize influence in defining the company’s culture and ensuring mission success.

Voltage Park Offices

Remote Workspace

Employees work remotely.

Voltage Park is a 100% remote company.

Typical time on-site: None
HQBerkeley, CA

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account