Distributed Systems Engineer

Job Posted 13 Hours Ago Posted 13 Hours Ago
Be an Early Applicant
2 Locations
Remote
100K Annually
Mid level
Artificial Intelligence • Software
The Role
As a Distributed Systems Engineer, you will develop data and coordination systems for long-context inference and training, focusing on high-performance storage, automation of fault detection, and troubleshooting across various environments.
Summary Generated by Built In

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role:

As a distributed systems engineer, you will build the data and coordination systems that enable ultra-long context inference and training on Magic’s GPU clusters. 

What you might work on: 

  • High-performance storage and caching systems to support long-context inference and training

  • Hacking on the internals of deep learning frameworks in the distributed setting

  • Automating fault detection and recovery systems to enable highly available training

  • Troubleshooting complex issues across GPUs, network, storage, OS, and cloud environments.

What we’re looking for: 

  • Deep knowledge of distributed systems design and public cloud platforms

  • Experience designing and operating highly available, high-throughput data systems

  • Experience with the internals of distributed DBMS, batch and stream processing systems, and/or distributed file systems

  • Exceptional problem-solving skills up and down the stack

Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.

Our culture:

  • Integrity. Words and actions should be aligned

  • Hands-on. At Magic, everyone is building 

  • Teamwork. We move as one team, not N individuals

  • Focus. Safely deploy AGI. Everything else is noise

  • Quality. Magic should feel like magic

Compensation, benefits and perks (US):

  • Annual salary range: $100K - $550K

  • Equity is a significant part of total compensation, in addition to salary

  • 401(k) plan with 6% salary matching

  • Generous health, dental and vision insurance for you and your dependents

  • Unlimited paid time off

  • Visa sponsorship and relocation stipend to bring you to SF, if possible

  • A small, fast-paced, highly focused team

Top Skills

Batch Processing Systems
Cloud Platforms
Dbms
Deep Learning Frameworks
Distributed File Systems
Distributed Systems
High-Performance Storage
Stream Processing Systems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
47 Employees
On-site Workplace
Year Founded: 2022

What We Do

Magic is working on frontier-scale code models to build a coworker, not just a copilot. Come join us: http://magic.dev

Similar Jobs

Sprout Social Logo Sprout Social

Staff Software Engineer - Distributed Systems

Marketing Tech • Social Media • Software • Analytics • Business Intelligence
Easy Apply
Remote
Hybrid
US
1420 Employees

Trumid Logo Trumid

Software Engineer (Distributed Systems)

Fintech • Information Technology • Payments • Software • Financial Services
Easy Apply
Remote
USA
153 Employees

Trumid Logo Trumid

Staff Software Engineer (Distributed Systems)

Fintech • Information Technology • Payments • Software • Financial Services
Easy Apply
Remote
USA
153 Employees

Elastic Logo Elastic

Elasticsearch - Java Engineer II - Distributed Systems

Cloud • Security • Software • Generative AI
Remote
United States
3222 Employees
133K-211K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account