10a Labs

Dev Ops / Infrastructure Engineer - 3 Month Contract

Reposted Yesterday

Easy Apply

New York City, NY

In-Office

Mid level

Artificial Intelligence • Analytics • Consulting • Cybersecurity

Research Services

The Role

Responsible for deploying, monitoring, and scaling ML-powered content moderation systems. Tasks include building CI/CD pipelines and implementing observability tooling.

Summary Generated by Built In

About 10a Labs: 10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely.

3 Month Contract | Remote | High-Impact

About the Role: We’re looking for an infrastructure-focused devOps / MLOps engineer who thrives at the intersection of machine learning, systems, and product delivery. This is a hands-on 3-month contract role responsible for deploying, monitoring, and scaling the testing and deployment infrastructure for a real-time ML-powered content moderation system used to detect and triage abuse, threats, and edge-case language.

In This Role, You Will:

Design, build, and document a maintainable GCP cloud infrastructure CI/CD pipeline for real-time model serving and data workflows
Deploy and optimize APIs for low-latency ML systems
Automate model deployment, retraining, and evaluation (CI/CD for ML)
Build observability tooling to monitor rollouts, errors, integration testing, and drift in ML pipelines
Ensure infrastructure meets security, compliance, and uptime requirements

We’re Looking for Someone Who:

Has 3–8 years of DevOps/Platform engineering experience deploying machine learning systems or high-availability backend systems.
Ability to build CI/CD pipelines from scratch; familiarity with GitHub Actions or similar.
Expert-level proficiency with Git and GitHub workflows and strong scripting abilities in Python, Bash, and/or Go.
Experience with Google Cloud Run and Docker. Experience with Google Cloud Platforms, Docker, Kubernetes, Terraform .
Familiarity with SOC 2 compliance requirements and security best practices (IAM, secrets, etc).
Experience implementing monitoring, logging, and alerting systems (e.g., Prometheius, Grafana, ELK/EFK, OpenTelemetry).
Can work cross-functionally with ML, security, and engineering teams to deploy safely and iterate fast.
Brings a builder's mindset and bias for ownership in ambiguous environments.

What Success Looks Like in the 3 Months:

You’ve deployed and monitored a real-time ML inference system with well-defined observability.
You’ve implemented an API with latency under 1000ms for classifier-based inference.
You’ve partnered with ML engineers to streamline deployment and retraining workflows.
You’ve built logging and monitoring that gives insight into system performance and classifier behavior.

Top Skills

Bash

Docker

Elk

GCP

Github Actions

Grafana

Kubernetes

Opentelemetry

Prometheus

Python

Terraform

View all jobs at 10a Labs

View 10a Labs Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

28 Employees

What We Do

10a Labs is an applied research and technology company specializing in AI security. We deliver intelligence collection, investigative research, and analysis for AI unicorns, Fortune 10 companies, and U.S. tech leaders.