ML Infrastructure Engineer

Reposted Yesterday
San Francisco, CA, USA
In-Office
160K-200K Annually
Mid level
Computer Vision • Machine Learning • Software
The Role
The ML Infrastructure Engineer at Mach9 will design and maintain CI/CD pipelines for ML workflows, optimize real-time inference services, and build data management systems, collaborating closely with ML researchers.
Summary Generated by Built In
The role

At Mach9, ML infrastructure engineers build and maintain the systems that power production AI models for civil engineering and surveying. Our ML pipeline spans 10,000+ miles of labeled survey data, image segmentation networks, and 3D prediction models serving real-time inference to surveyors and engineers in the field.

This role is ideal for mid-career ML infrastructure engineers with experience building for both training and inference.

You'll build training pipelines that handle deep transformer models on hundreds of terabytes of 3D point cloud and image data. You'll also architect our inference infrastructure, delivering both heavy offline detection algorithms and real-time responsive inference that integrates directly with our CAD software.

Responsibilities
  • Design and build a centralized system for versioning training data, generated datasets, and model artifacts, with full lineage tracking from raw source data through to trained model outputs.

  • Develop and maintain reliable, reproducible ML training and data generation pipelines.

  • Refactor and harden existing training and data generation scripts into composable, testable, and maintainable components.

  • Create CI/CD workflows for validating data pipelines and model training runs, including automated correctness checks and regression detection.

  • Build tooling that enables ML engineers to launch, monitor, and debug training jobs with minimal friction.

  • Optimize and scale real-time model inference services to meet latency and throughput requirements in production, including profiling, batching strategies, and resource-efficient serving.

  • Own the deployment path from trained model artifact to production endpoint, ensuring reliable rollouts, rollback, and monitoring.

Requirements
  • 3+ years of work experience in relevant fields.

  • Bachelor's or Master's degree in Computer Science, Engineering, or equivalent experience.

  • Strong communication skills and the ability to work closely with ML researchers and engineers to understand their workflows and translate them into robust systems.

  • Experience designing and building data versioning, artifact management, or dataset lineage systems (e.g., DVC, LakeFS, Weights & Biases, or custom solutions).

  • Hands-on experience with ML pipeline orchestration tools (e.g., Airflow, Prefect, Metaflow, or similar).

  • Experience with model serving and inference optimization — profiling latency, reducing memory footprint, or scaling serving infrastructure to meet real-time constraints.

  • Ability to read and refactor ML training code — you don't need to design model architectures, but you need to understand what training pipelines are doing well enough to make them reliable.

  • Proficient with Python, PyTorch.

Bonus qualifications
  • Familiarity with AWS infrastructure services.

  • Experience with containerized ML workflows and GPU-accelerated training environments.

  • Experience with model optimization techniques (e.g., quantization, TensorRT, ONNX Runtime, distillation).

  • Knowledge of infrastructure-as-code tools (e.g., AWS CDK, Terraform).

  • Experience building or operating ML systems that handle large unstructured datasets (imagery, 3D data, sensor data).

Skills Required

  • 3+ years of experience in relevant fields
  • Bachelor's or Master's degree in Computer Science, Engineering, or equivalent
  • Experience designing data versioning or artifact management systems
  • Hands-on experience with ML pipeline orchestration tools
  • Experience with model serving and inference optimization
  • Ability to read and refactor ML training code
  • Proficient with Python and PyTorch
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
24 Employees
Year Founded: 2021

What We Do

Mach9 is at the forefront of leveraging advanced machine learning and computer vision techniques to transform raw geospatial data into actionable insights to help civil engineers build and maintain infrastructure globally. Our first product, Mach9 Digital Surveyor, helps surveyors automatically extract features from large-scale imagery and 3D datasets over 30x faster than today's manual and labor-intensive drafting workflows, accelerating the development of cost-effective and sustainable transportation and utility infrastructure. Mach9 supports leading asset owners and engineering and construction organizations globally solve the toughest engineering design, mission planning, and asset management problems.

Similar Jobs

Whatnot Logo Whatnot

Software Engineer

eCommerce • Mobile • Retail
In-Office
4 Locations
1200 Employees
190K-300K Annually
In-Office
2 Locations
2359 Employees
170K-216K Annually
In-Office
Santa Clara, CA, USA
471 Employees

Slickdeals Logo Slickdeals

Infrastructure Engineer

Consumer Web • Digital Media • eCommerce
Hybrid
San Mateo, CA, USA
156 Employees
170K-220K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account