Network and Systems Engineer

Posted 9 Days Ago
Hiring Remotely in San Francisco, CA
Remote
120K-160K Annually
Senior level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
We build infrastructure for machine learning
The Role
The Network and Systems Engineer will design, build, and support the AI Data Center network, compute, and storage environment. Responsibilities include troubleshooting, participating in on-call rotations, and collaborating with various teams to streamline operations and ensure effective incident response.
Summary Generated by Built In

Voltage Park is on a mission to make machine learning infrastructure accessible to all, from large enterprises and research universities, to seed-stage startups and nonprofits. We believe that providing seamless access to compute with pricing and inventory transparency is the future of access to GPUs. We are the only cloud provider offering a platform that shows all GPUs available to be rented at any point in time with transparent, market-based pricing, in addition to long-term reserve contracts for our customers. 

Voltage Park is seeking an experienced Network and Systems Engineer to join our team. The ideal candidate will be responsible for building and supporting our AI Data Center network/compute/storage environment. To succeed in this role, you will need to be comfortable owning deployment and troubleshooting tasks covering both network and systems hardware and software.

This is a fully remote role, but you must be located in Poland. This resource will participate in an on-call rotation to support our 24/7 “follow-the-sun” Operations model.

What You’ll Do:

  • At the direction of the Manager of Network Engineering, the engineer will assist in the design, build, roll-out and support of new and existing network components and software.

  • Additionally, the engineer will work closely with the Manager of SRE to assist in the build, roll-out, and support of new and existing compute/storage components and software.

  • Collaborate with colleagues in Network Engineering, Site Reliability Engineer, and Customer Support in a flat organization

  • Be on-call for urgent system incident response.

Qualifications:

  • Network Vendor certifications, such as CCNP, CCIE, JNCIP, JNCIE, or equivalent.

  • Understanding of contemporary networking technologies such as IP Clos, VXLAN, EVPN, and multi-tiered data center architectures.

  • Experience leveraging automation tools and platforms, including Nautobot/Netbox or other database-driven Sources of Truth.

  • Understanding and experience with network automation frameworks, specifically Ansible, Puppet, Terraform, or other.

  • Experience with Python, Rust, Go, or other development language, as well as Vendor-supplied automation environments such as Arista CloudVision or Juniper Apstra.

  • Understand full-stack architectures inclusive of Customer-driven workflows, Infrastructure as Code (IaC) and CI/CD deployment models, Cloud Service Provider (CSP) Network Orchestration & Automation, Sources of Truth, Network Observability and Telemetry, and knowing how open source can lead or compliment a given architecture.

  • Works closely with SRE and Development teams to propose and develop code for integrated Operations environments, with a key focus on composable and re-usable code. 

  • Experience with cloud networking technologies including VPC’s, NFV, Direct Connect, Cloud Connect

  • You enjoy working with a small group of friendly, highly motivated, high-execution colleagues

  • You’re comfortable with a high degree of autonomy, can independently prioritize your work and understand how it maps to the overall needs and goals of the company

  • You’re knowledgeable in your domain but also enjoy wearing multiple hats and venturing outside of your comfort zone when the need arises

  • You must write well and understand the importance of good and complete documentation.

  • Fluent English (C1) 

What do we offer:

  • 100% funded medical care

  • Sports Package 

  • Equity Package 

  • Possibility to travel to USA 

  • 100% Remote work 

Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter. 

Compensation Range: $120K - $160K

Top Skills

Go
Python
Rust

What the Team is Saying

Melissa Du
The Company
HQ: San Francisco, CA
51 Employees
Remote Workplace
Year Founded: 2023

What We Do

The market for cutting-edge ML compute is broken. Startups, researchers and even big AI labs are scrambling to buy or rent access to the latest chips for ML training. But demand far outstrips supply, and what’s available is only accessible to the well-resourced, placing an artificial damper on innovation.

To solve this challenge, we've launched Voltage Park, and we’re on a mission to make machine learning infrastructure accessible to all, from large enterprises and research universities, to seed-stage startups and nonprofits.

With around 24,000 NVIDIA H100 GPUs, the Voltage Park cloud is one of the most powerful collections of cutting-edge ML compute in the world. Our clusters consist of 80GB H100 SXM5 GPUs fully interconnected with 3.2T InfiniBand.

Why Work With Us

You’ll play a pivotal role as a member of the founding team that will change the face of machine learning infrastructure. As an early hire, you’ll have outsize influence in defining the company’s culture and ensuring mission success.

Voltage Park Offices

Remote Workspace

Employees work remotely.

Voltage Park is a 100% remote company.

Typical time on-site: None
HQSan Francisco, CA

Similar Jobs

Voltage Park Logo Voltage Park

Solutions Engineer

Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
Remote
San Francisco, CA, USA
51 Employees
145K-185K Annually

Voltage Park Logo Voltage Park

Network Automation Engineer

Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
Remote
San Francisco, CA, USA
51 Employees
120K-160K Annually

Voltage Park Logo Voltage Park

Site Reliability Engineer

Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
Remote
San Francisco, CA, USA
51 Employees
140K-180K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account