Senior Machine Learning Engineer (Ops)

Reposted 7 Days Ago
Easy Apply
Be an Early Applicant
Hiring Remotely in Hyderabad, Telangana, IND
In-Office or Remote
6-6 Annually
Senior level
Artificial Intelligence • Machine Learning • Robotics • Software
The Role
The role involves owning the MLOps infrastructure, migrating detection pipelines, building CI/CD pipelines, and ensuring system reliability in production.
Summary Generated by Built In

About Us

Are you ready to build the future of supply chain? At Gather AI, we're not just creating software; we're pioneering a new era of warehouse intelligence. We've developed a groundbreaking, vision-powered platform that uses autonomous drones and existing equipment to capture real-time data, completely digitizing workflows that have historically been manual and error-prone. This means facilities operate smarter, safer, and more efficiently, ultimately redefining "on-time, in full" delivery.

If you're looking for an opportunity to contribute to truly transformative technology and make a significant impact in a vital industry, Gather AI is the place for you. We're leading the charge in the rapidly evolving robotics industry, and we invite you to join us in reshaping the global supply chain, one intelligent warehouse at a time.

About the Team

Our Engineering team builds the systems that turn cutting-edge ML research into reliable, production-grade infrastructure. We operate at the intersection of machine learning, cloud infrastructure, and real-world logistics — shipping software that directly impacts how goods move around the world. We value operational excellence, first-principles thinking, and the kind of engineering that makes complex systems look effortless.

About the Role

We're looking for a Senior ML Engineer (Ops) to own the infrastructure engine behind Gather's machine learning platform. This is a hands-on, high-ownership role for someone who thrives on the "last mile" problem — ensuring that sophisticated vision models don't just work in research, but run reliably at scale in production environments.

You'll be the primary builder and maintainer of our MLOps platform, leading the transition from manual, ad-hoc deployments to a fully automated, enterprise-grade system. If you're energized by building things that actually ship — and that deliver measurable, physical-world impact — this role was designed for you.

What You'll Do

  • Migrate box and barcode detection pipelines to cloud infrastructure following MLOps best practices
  • Build and maintain CI/CD pipelines for deployment across production and non-production environments
  • Implement automated rollback, canary, and blue-green deployment strategies for ML microservices
  • Build out a multi-tenant MLOps platform using tools like Prefect, ZenML, or similar orchestration frameworks
  • Establish a centralized model registry and versioning system for all production assets
  • Instrument observability across the ML stack — logging, metrics, and distributed tracing — to ensure reliability at scale

What You'll Need

  • 6+ years of industry experience (outside academia) in ML engineering, MLOps, or infrastructure engineering
  • Deep operational fluency with Kubernetes and Docker for ML workload orchestration
  • Strong production-grade Python skills with a track record of hardening research code into scalable microservices
  • Hands-on experience with CI/CD for ML (e.g., GitHub Actions, GitLab CI) and model serving frameworks (e.g., KServe, SageMaker, Vertex AI Endpoints)
  • Experience with pipeline orchestration and model lifecycle tools such as Airflow, MLflow, Kubeflow, or Flyte
  • Proven ownership of production system reliability, including SRE principles, observability stacks, and automated failure safeguards

Nice to Have

  • Prior experience building end-to-end MLOps pipelines (data, model, and inference) from scratch
  • Domain experience in logistics, supply chain, or robotics-adjacent cloud platforms
  • Familiarity with feature stores and training/serving data consistency patterns
  • Experience with Infrastructure as Code tools such as Terraform

Top Skills

Airflow
Ci/Cd
Docker
Flyte
Github Actions
Gitlab Ci
Kserve
Kubeflow
Kubernetes
Mlflow
Prefect
Python
Sagemaker
Terraform
Vertex Ai
Zenml
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Pittsburgh, PA
27 Employees
Year Founded: 2019

What We Do

The world's first software-only autonomous inventory management platform for modern warehouses.

Similar Jobs

Circle (circle.so) Logo Circle (circle.so)

Customer Success Manager

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
110K-120K Annually

Bounteous Logo Bounteous

Manager, Global Compliance and Corporate Governance

Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Remote or Hybrid
India
5000 Employees
10-10 Annually

Celonis Logo Celonis

Territory Partner Manager (Mumbai)

Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
Remote or Hybrid
India
3000 Employees

MetLife Logo MetLife

Data Scientist

Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Remote or Hybrid
India
43000 Employees

Similar Companies Hiring

Fairly Even Thumbnail
Hardware • Other • Robotics • Sales • Software • Hospitality
New York, NY
30 Employees
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account