Software Engineer — Distributed Systems

Posted 4 Days Ago
Be an Early Applicant
2 Locations
In-Office or Remote
Mid level
Artificial Intelligence • Information Technology
The Role
As a Software Engineer for Distributed Systems, you will design and validate distribution libraries and protocols for scalable multi-device clusters, ensuring performance and correctness in AI workloads.
Summary Generated by Built In
Meet Arago and the Aragonians

Arago is an AI and computer hardware company whose mission is to drive the course of history forward. We do so by accelerating breakthroughs at the intersection of AI and semiconductors.

Founded in 2024 by AI researchers and physicists with deep expertise in photonics, electronics, software, mathematics, and machine learning, Arago brings together a lean team of engineers and scientists from the world’s top companies and research labs.

Composed of nine nationalities and operating from hubs in France, North America, and Israel, we believe in great science and fast achievements. Our work is guided by these core principles:

  1. Do great things: we deliver work we’re proud to sign our name to.

  2. High velocity: speed matters. We move quickly, one step at a time.

  3. One unit: we’re all in this together, with relationships grounded in trust, respect, and camaraderie.

Arago is backed by executives from Apple, Arm, Nvidia, Microsoft, and Hugging Face, as well as prominent US and European deeptech venture firms and exited founders.

What you’ll do

As a Distributed Systems Software Engineer, you will be the architect behind scaling Arago’s accelerator from a single device to massive multi-device and multi-node clusters. Your work focuses on creating the distribution libraries, programming model extensions, and observability tooling that make clustered accelerators usable, debuggable, and correct at scale. You will not only design these systems but also validate them through hands-on prototype deployments, ensuring our hardware powers the next generation of distributed AI workloads.

Required Skills and Experience
  • Master’s or PhD degree in Computer Science, Distributed Systems, or a related field.

  • Strong understanding of OS internals relevant to distributed environments (networking stacks, I/O, IPC).

  • Solid knowledge of memory safety and concurrency control in complex, long-running systems.

  • Practical experience with virtualization frameworks and isolation in distributed deployments.

  • Proven understanding of resource-sharing protocols, coordination primitives, and distributed memory models.

  • Familiarity with parallel programming models (e.g., MPI, NCCL, or similar) for distributed software.

  • Ability to model and emulate distributed system behavior to analyze performance and failure modes.

  • Language: English at a proficient level. French is a plus.

Responsibilities
  • Develop high-performance distribution libraries enabling seamless multi-device and multi-node accelerator use.

  • Architect and implement distribution features within the programming model to support coordination and synchronization.

  • Implement robust protocols for resource sharing and memory unification across heterogeneous devices and nodes.

  • Drive the prototype deployment and validation of clustered accelerator systems.

  • Build specialized tooling for distributed systems, including cross-node tracing and global debugging.

  • Integrate virtualization and hardware-level isolation mechanisms into distributed deployments.

  • Model and emulate distributed accelerator behavior to ensure architectural correctness and peak performance.

  • Debug critical issues spanning the host, device, runtime, and network layers.

Pay and benefits
  • Competitive cash compensation, with final package based on location, experience, and the pay of team members in similar positions.

  • Meaningful stock option plan offered at the earliest stage of the company (included in the majority of full time offers).

  • Relocation bonus and coverage of moving expenses for relocation within 20 minutes of the company’s location.

  • Healthcare coverage (including family-friendly options), pension contributions, professional development support, and 25 days of PTO, in addition to public holidays.

  • Ownership of a key technical domain, with significant vertical and/or horizontal growth opportunities, based on performance and individual drive.

  • Reimbursement of 50% of the public transport subscription fee.

  • A high-paced, multicultural (with 10 nationalities), and engineering-led environment.

Our Hiring Process: Your Journey to Arago
  • Screening Call : Get to know you beyond your CV.

  • Technical meeting : Deep dive into your past projects and technical achievements.

  • CEO Interview : Assess the fit with the team’s culture and long-term vision.

  • Reference Calls: Mandatory calls with your former managers to validate strengths, weaknesses, and work style.

  • Technical Assessment: Take-home technical assignment crafted to the role you’re applying for.

  • Final Interview : Discuss your technical assignment and address any remaining questions with team members.

Top Skills

Distributed Systems
Mpi
Nccl
Parallel Programming Models
Virtualization Frameworks
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Paris
20 Employees

What We Do

Arago is hiring in Paris, California, and Israel.
If you're excited about shaping the future of computing and AI, we'd love to hear from you.

Similar Jobs

Mondelēz International Logo Mondelēz International

Global Digital Smart Factory Analyst, Mondelez Digital Services

Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Remote or Hybrid
12 Locations
90000 Employees

CWAN Logo CWAN

Business Development Representative

Fintech • Software • Financial Services
Remote or Hybrid
2 Locations
1100 Employees

Suite Studios Logo Suite Studios

Account Executive

Cloud • Digital Media • Professional Services • Database
In-Office or Remote
2 Locations
20 Employees
200K-240K Annually
Remote
Office, Machaze, Manica, MOZ
2862 Employees
159K-204K Annually

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account