Forward Deployed Systems Engineer

Reposted 19 Days Ago
Be an Early Applicant
San Jose, CA
In-Office
150K-240K Annually
Mid level
Artificial Intelligence • Hardware • Software
The Role
Support deployments of AI accelerators in datacenter environments, troubleshoot issues, validate performance, and guide customers through system design and rollout.
Summary Generated by Built In

About Etched

Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history.

Job Summary
As a Forward Deployed Systems Engineer (Datacenter), you will work hands-on in large-scale datacenter environments to bring up, validate, and debug Etched accelerator-based systems. You’ll own customer deployment execution end-to-end, solve complex hardware/software/networking issues under real-world constraints, and build the tooling and processes that make deployments scalable and repeatable.

Key Responsibilities

Customer deployment ownership

  • Own technical execution of customer deployments, including readiness assessments, installation coordination, integration testing, and stabilization

  • Serve as the primary technical point of contact during deployment and ramp, ensuring rapid issue resolution and customer confidence

  • Drive deployment planning with customers and internal teams to identify risks and establish success criteria

Technical problem solving & diagnostics

  • Debug and resolve system-level issues across hardware, firmware, drivers, Linux OS, and networking stacks

  • Perform root-cause analysis on field issues and partner with internal engineering teams to reproduce issues and ship fixes

  • Build and maintain diagnostics, monitoring/telemetry, and dashboards to improve visibility into system health

Firmware & system updates

  • Coordinate firmware updates, hotfixes, and configuration changes to improve reliability and unblock customer deployments

  • Manage update rollouts with minimal downtime and clear communication to customers and internal teams

Documentation & knowledge management

  • Create and maintain deployment runbooks, escalation playbooks, known-issues documentation, and field service guides

  • Capture structured learnings from deployments and convert them into repeatable processes

Cross-functional collaboration

  • Partner with firmware/software/hardware engineering teams to drive reliability improvements

  • Collaborate with Deployment, RMA, Sustaining Engineering, and Support teams to ensure smooth handoffs and long-term supportability

Process improvement & scaling

  • Improve deployment workflows, metrics, and best practices to reduce deployment risk, time, and cost

  • Identify patterns across deployments and drive process/tooling improvements that scale across customers

You may be a good fit if you have

  • Bachelor’s degree in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent practical experience)

  • Experience supporting datacenter customers in systems, networking, AI infrastructure, or field engineering roles

  • Deep experience with Linux/Unix installation, configuration, debugging, and performance tuning

  • Strong networking fundamentals including TCP/IP, RDMA, and low-latency Ethernet
    Hands-on experience with high-speed networking such as 100GbE / 400GbE

  • Strong ability to diagnose performance and integration issues across hardware + OS + networking layers
    BMC experience, telemetry/monitoring systems, RMA/failure analysis exposure

  • Travel up to 30% to customer sites for deployments, on-site debugging, and launch support

Benefits

  • Medical, dental, and vision packages with generous premium coverage

  • $500 per month credit for waiving medical benefits

  • Housing subsidy of $2k per month for those living within walking distance of the office

  • Relocation support for those moving to San Jose (Santana Row)

  • Various wellness benefits covering fitness, mental health, and more

  • Daily lunch + dinner in our office

How we’re different

Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

We are a fully in-person team in San Jose (Santana Row), and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

Top Skills

100Gbe
400Gbe
Ai Systems
Ethernet
Linux
Networking
Rdma
Tcp/Ip
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Cupertino, CA
53 Employees
Year Founded: 2022

What We Do

By burning the transformer architecture into our chips, we’re creating the world’s most powerful servers for transformer inference.

Similar Jobs

Modal Logo Modal

Forward Deployed Engineer - Systems

Machine Learning • Generative AI
In-Office
3 Locations
50 Employees
180K-240K Annually

Afterpay Logo Afterpay

Director, CRM

Fintech • Payments • Software • Financial Services
Hybrid
San Francisco, CA, USA
900 Employees
240K-359K Annually

Snap Inc. Logo Snap Inc.

Manager, Electrical Engineering

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
2 Locations
5000 Employees
195K-343K Annually

Snap Inc. Logo Snap Inc.

Lead, SMC Marketing

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
4 Locations
5000 Employees
121K-214K Annually

Similar Companies Hiring

Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account