Senior DevOps / ML Infrastructure Engineer - AI Lab

Posted 3 Days Ago
Be an Early Applicant
5 Locations
In-Office or Remote
Senior level
Other
The Role
As a Senior DevOps/ML Infrastructure Engineer, you'll manage infrastructure, support ML model integration, and build automated MLOps pipelines in a collaborative setting.
Summary Generated by Built In
Secure Global Money Transfers with Cutting-Edge Technology. 

Join our mission to protect cross-border transactions, helping customers send money safely worldwide.

As a Senior DevOps / ML Infrastructure Engineer in our AI Lab, you'll maintain and scale our infrastructure while enabling seamless ML model integration into production workflows.

You'll work alongside our Senior MLOps Architect to build a comprehensive ML platform that serves multiple teams across the organization.

What You'll Do:

  • Manage multiple orchestration platforms: Kubernetes in AWS (CloudFormation) and on-prem Kubernetes clusters-
  • Maintain Apache Flink infrastructure (managed in AWS or self-hosted in on-prem Kubernetes)
  • Handle production support, incident response, and on-call rotations
  • Perform regular patching activities and security vulnerability remediation
  • Support and maintain workflow engine infrastructure
  • Improve observability by utilizing Prometheus, Grafana, Splunk, Slack alerts, etc.

MLOps & Platform Development:

  • Collaborate with Senior MLOps Architect to build and maintain ML infrastructure
  • Set up and configure MLflow for experiment tracking and model registry
  • Build automated MLOps pipelines for model training, experimentation, and deployment (Champion-Challenger, shadow mode)
  • Support feature calculation pipelines and ETL processes
  • Enable model serving infrastructure for Python-based ML services

We're Looking For:

  • 3-5+ years of professional experience in DevOps or infrastructure engineering
  • Strong hands-on experience with AWS services (EKS, ECR, SQS, S3, Managed Kafka, Managed Prometheus)
  • Deep experience with Kubernetes in production environments (multi-cluster management is a plus)
  • Proficiency with infrastructure as code: AWS CloudFormation and CDK (AWS Cloud Development Kit)
  • Experience with containerization (Docker) and container orchestration
  • Knowledge of setting up and maintaining CI/CD pipelines (GitHub Actions, ArgoCD, Jenkins, etc.)
  • Hands-on experience with observability tools: Prometheus, Grafana, Splunk- Experience with production support, incident response, and on-call rotations
  • Strong communication skills (English B2+)
  • Ability to work collaboratively with cross-functional teams (MLOps engineers, data scientists, software engineers)

It would be a plus:

  • Experience with Apache Flink, Kafka, or other stream processing frameworks
  • Understanding of ML lifecycle: model training, evaluation, deployment patterns
  • Experience with workflow engines or rule engines
  • Knowledge of fraud prevention, fintech, or compliance domains
  • Understanding of feature stores, ETL pipelines, and data engineering concepts

What We Offer:

  • Remote work flexibility – work from anywhere- B2B contract with competitive gross compensation in USD
  • Top-tier hardware to support your productivity
  • A challenging role in a team of skilled professionals with opportunity to grow into MLOps specialization
  • Direct collaboration with Senior MLOps Architect to learn and contribute to ML platform development
  • Continuous learning and career growth opportunities
  • Coverage for professional development: training, seminars, and conferences
  • Access to high-quality English lessons
  • Impact: Your work will directly prevent fraud while enabling secure financial access globally

Why This Role:

This position offers a unique opportunity to work at the intersection of traditional DevOps and MLOps. You'll maintain critical infrastructure while building expertise in ML infrastructure, model deployment, and workflow integration. You'll complement our MLOps Architect by handling general infrastructure needs while growing your ML platform skills, ultimately enabling faster delivery of ML capabilities across the organization.

Top Skills

Apache Flink
Argocd
AWS
Docker
Github Actions
Grafana
Jenkins
Kubernetes
Mlflow
Prometheus
Splunk
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Newark, NJ
1,151 Employees
Year Founded: 1990

What We Do

We know that you’ll have looked at quite a few company pages but IDT is different. We want people who want to make a big difference to our company with big ambitions. We’re a truly global team, with 1300 people working across all continents, apart from (at the moment!) Antarctica. But we are proud that despite our size, we encourage and support any in-house entrepreneurs to develop their ideas into business action. Our exciting growth plans make it a great time to join us.

Our people are the reason for IDT’s passion for success. The IDT family is made up of people of all backgrounds, expertise, and interests, all with a relentless team spirit. We need people who share both our commitment to success and excitement about our journey. You won’t ever be bored or have to wonder how to fill your time. You’ll find the work challenging but you’ll get the support of a great team to help you beat those challenges. You will also be expected to support others as well as work hard, work well and work with a smile.

If you want a join a company that will help you become your brilliant best and achieve amazing results, then you want to join IDT.

Similar Jobs

ServiceNow Logo ServiceNow

Director, Renewal Sales

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Madrid, Comunidad de Madrid, ESP
28000 Employees
10-10 Annually

ServiceNow Logo ServiceNow

Architect

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Madrid, Comunidad de Madrid, ESP
28000 Employees

Tulip Logo Tulip

Solutions Engineer

Enterprise Web • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
28 Locations
310 Employees

GitLab Logo GitLab

Back-end Engineer

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
In-Office or Remote
33 Locations
2500 Employees

Similar Companies Hiring

Spark Advisors Thumbnail
Software • Sales • Other • Insurance • Healthtech
New York, NY
110 Employees
Cox Enterprises Thumbnail
Software • Other • Information Technology • Greentech • Cybersecurity • Cloud • Automotive
Atlanta, GA
50000 Employees
Compa Thumbnail
Software • Other • HR Tech • Business Intelligence • Artificial Intelligence
Irvine, CA
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account