MLOps Engineer — AI/ML Systems & Deployment (TS/SCI Preferred)

Reposted 5 Days Ago
Dayton, OH, USA
In-Office
Mid level
Artificial Intelligence • Cloud • Machine Learning
The Role
The MLOps Engineer will manage the ML system lifecycle, deploy models in mission-critical environments, and ensure reliability through monitoring and infrastructure optimization. They will work with cloud-native technologies and focus on operationalizing AI/ML capabilities within classified settings.
Summary Generated by Built In

MLOps Engineer — AI/ML Systems & Deployment (TS/SCI Preferred)
Dayton, OH (On-site Preferred) | Remote Eligible (U.S.-based, Clearance-Ready)
Clearance-Eligible Role | Mission-Critical AI/ML Systems

About the Role

At Rackner, we build systems where advanced technologies move beyond prototypes and into real-world operational use.

We are seeking an MLOps Engineer to support the deployment and lifecycle management of AI/ML systems within a secure, mission-focused environment.

This is not a research role.

This is where models become reliable, deployable, and auditable systems.

You will operate at the intersection of:

  • machine learning
  • cloud-native infrastructure
  • distributed systems

…and ensure AI/ML systems are production-ready in environments where reliability and performance matter.

What You’ll Do

Own the ML Lifecycle (End-to-End)

  • Build and operate production-grade ML pipelines
  • Orchestrate workflows using Kubeflow, Airflow, or Argo
  • Implement model versioning, lineage, and reproducibility standards

Operationalize AI/ML Systems

  • Deploy models into secure and constrained environments
    Transition workflows from experimentation → containerized pipelines → production systems
    Enable both batch and real-time inference architectures

Engineer for Reliability

  • Design systems for reproducibility, auditability, and stability
  • Monitor model performance and system health using Prometheus, Grafana, OpenTelemetry
  • Detect and resolve issues such as model drift and system degradation

Build Cloud-Native ML Infrastructure

  • Deploy and manage Kubernetes-based ML workloads
  • Containerize pipelines using Docker
  • Support scalable training and inference workflows

Establish Data Discipline

  • Support feature engineering and dataset preparation
  • Implement data versioning and governance practices (e.g., lakeFS)
  • Apply metadata and data management standards

Create Repeatable Systems

  • Develop runbooks, playbooks, and documentation
  • Build systems that are operationally sustainable and transferable

What You Bring

Core Experience

  • Experience deploying ML systems into production environments
  • Strong programming skills in Python
  • Hands-on experience with:
    • ML pipeline tools (Kubeflow, Airflow, Argo)
    • Experiment tracking tools (MLflow, ClearML)

Infrastructure & Systems

  • Experience with Kubernetes and containerized systems (Docker)
  • Familiarity with CI/CD pipelines
  • Understanding of distributed systems and scalable architectures

ML Application Exposure

  • Experience working with:
    • LLMs or transformer-based models
    • Computer vision systems (YOLO, Faster R-CNN)
  • Focus on deployment and integration, not pure research

Mindset

  • Systems thinker who prioritizes reliability over novelty
  • Comfortable operating in complex, evolving environments
  • Focused on delivering real-world outcomes

Clearance Requirements

  • Active TS/SCI clearance strongly preferred
  • Candidates with an active Secret clearance may be considered and supported for upgrade
  • Candidates without an active clearance must be:
    • U.S. citizens
    • eligible to obtain and maintain a clearance
    • able to work in a CAC-enabled or secure environment

Note: Start timelines and work scope may vary depending on clearance status and program requirements

Why This Role Matters (What You Get)

This role is a career accelerator for engineers who want to:

  • Move beyond experimentation and own production systems
  • Work across ML, infrastructure, and deployment pipelines
  • Build in high-trust, secure environments
  • Develop high-demand MLOps expertise in constrained systems
  • Deliver systems that are used, not just built

Who We Are

Rackner is a software consultancy that builds cloud-native solutions for startups, enterprises, and the public sector. We are an energetic, growing team focused on solving complex problems through:

  • Distributed systems
  • DevSecOps
  • AI/ML
  • Cloud-native architecture

Our approach is cloud-first, cost-effective, and outcome-driven, delivering systems that scale and perform in real-world environments.

Benefits & Perks

  • 100% covered certifications & training aligned to your role
  • 401(k) with 100% match up to 6%
  • Highly competitive PTO
  • Comprehensive Medical, Dental, Vision coverage
  • Life Insurance + Short & Long-Term Disability
  • Home office & equipment plan
  • Industry-leading weekly pay schedule

Apply

If you’re an engineer who wants to move from building models → owning production systems, we’d like to connect.


#MLOps #MachineLearning #Kubernetes #AIEngineering #CloudNative #DevSecOps #ArtificialIntelligence #DataEngineering #DefenseTech #NationalSecurity #AIInfrastructure #Hiring #TechCareers

Skills Required

  • Experience deploying ML systems into production environments
  • Strong background in Python and ML frameworks
  • Experience with ML pipeline orchestration tools
  • Experience with Kubernetes and containerized workloads
  • Familiarity with CI/CD for ML systems
  • Experience working with LLMs or transformer-based models
  • Experience with computer vision systems
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Silver Spring, MD
11 Employees
Year Founded: 2015

What We Do

Rackner builds cutting-edge solutions that apply DevSecOps and the power of AI in the datacenter, public and private clouds, and edge, leveraging the future of compute capability and technologies like Kubernetes (k8s) and WebAssembly (WASM). We're a member of the Cloud Native Computing Foundation and a Kubernetes Certified Service Provider - as well as a partner to the major public cloud companies. Our customers include hypergrowth startups and federal agencies, both Civilian and Defense. Core Competencies - DevSecOps - Edge Computing - AI/ML - Cloud-Native and Hybrid-Cloud development - Web and Mobile Applications Development (Microservices)

Similar Jobs

MetLife Logo MetLife

Customer Care Advocate Disability Intake - Omaha NE 8.3.26

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
United States
43000 Employees
20-20 Hourly

Zscaler Logo Zscaler

Director, Global CXO Programs

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
USA
8697 Employees
168K-240K Annually

Adstra Logo Adstra

Demand Generation Lead

AdTech • Big Data • Digital Media • Marketing Tech • Database • Automation
In-Office or Remote
2 Locations
175 Employees

PwC Logo PwC

SAP Technology Gen AI, Manager

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
51 Locations
370000 Employees
99K-232K Annually

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York City, NY
100 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account