Mach9

ML Infrastructure Engineer

Reposted 18 Days Ago

San Francisco, CA, USA

In-Office

160K-250K Annually

Mid level

Computer Vision • Machine Learning • Software

The Role

The ML Infrastructure Engineer at Mach9 will design and maintain CI/CD pipelines for ML workflows, optimize real-time inference services, and build data management systems, collaborating closely with ML researchers.

Summary Generated by Built In

The role

At Mach9, ML infrastructure engineers build and maintain the systems that power production AI models for civil engineering and surveying. Our ML pipeline spans 10,000+ miles of labeled survey data, image segmentation networks, and 3D prediction models serving real-time inference to surveyors and engineers in the field.

This role is ideal for mid-career ML infrastructure engineers with experience building for both training and inference.

You'll build training pipelines that handle deep transformer models on hundreds of terabytes of 3D point cloud and image data. You'll also architect our inference infrastructure, delivering both heavy offline detection algorithms and real-time responsive inference that integrates directly with our CAD software.

Responsibilities

Design and build a centralized system for versioning training data, generated datasets, and model artifacts, with full lineage tracking from raw source data through to trained model outputs.
Develop and maintain reliable, reproducible ML training and data generation pipelines.
Refactor and harden existing training and data generation scripts into composable, testable, and maintainable components.
Create CI/CD workflows for validating data pipelines and model training runs, including automated correctness checks and regression detection.
Build tooling that enables ML engineers to launch, monitor, and debug training jobs with minimal friction.
Optimize and scale real-time model inference services to meet latency and throughput requirements in production, including profiling, batching strategies, and resource-efficient serving.
Own the deployment path from trained model artifact to production endpoint, ensuring reliable rollouts, rollback, and monitoring.

Requirements

3+ years of work experience in relevant fields.
Bachelor's or Master's degree in Computer Science, Engineering, or equivalent experience.
Strong communication skills and the ability to work closely with ML researchers and engineers to understand their workflows and translate them into robust systems.
Experience designing and building data versioning, artifact management, or dataset lineage systems (e.g., DVC, LakeFS, Weights & Biases, or custom solutions).
Hands-on experience with ML pipeline orchestration tools (e.g., Airflow, Prefect, Metaflow, or similar).
Experience with model serving and inference optimization — profiling latency, reducing memory footprint, or scaling serving infrastructure to meet real-time constraints.
Ability to read and refactor ML training code — you don't need to design model architectures, but you need to understand what training pipelines are doing well enough to make them reliable.
Proficient with Python, PyTorch.

Bonus qualifications

Familiarity with AWS infrastructure services.
Experience with containerized ML workflows and GPU-accelerated training environments.
Experience with model optimization techniques (e.g., quantization, TensorRT, ONNX Runtime, distillation).
Knowledge of infrastructure-as-code tools (e.g., AWS CDK, Terraform).
Experience building or operating ML systems that handle large unstructured datasets (imagery, 3D data, sensor data).

Skills Required

3+ years of experience in relevant fields
Bachelor's or Master's degree in Computer Science, Engineering, or equivalent
Experience designing data versioning or artifact management systems
Hands-on experience with ML pipeline orchestration tools
Experience with model serving and inference optimization
Ability to read and refactor ML training code
Proficient with Python and PyTorch

View all jobs at Mach9

View Mach9 Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: San Francisco, CA

24 Employees

Year Founded: 2021

What We Do

Mach9 is at the forefront of leveraging advanced machine learning and computer vision techniques to transform raw geospatial data into actionable insights to help civil engineers build and maintain infrastructure globally. Our first product, Mach9 Digital Surveyor, helps surveyors automatically extract features from large-scale imagery and 3D datasets over 30x faster than today's manual and labor-intensive drafting workflows, accelerating the development of cost-effective and sustainable transportation and utility infrastructure. Mach9 supports leading asset owners and engineering and construction organizations globally solve the toughest engineering design, mission planning, and asset management problems.