The Role
Lead design and governance of enterprise MLOps platforms on AWS, implement CI/CD, monitoring, model versioning, scalability, security, cost optimization, and mentor MLOps engineers while aligning platform strategy with business needs.
Summary Generated by Built In
Lead MLOps Architect – AWS
Experience: 10–12 Years
Location: Egypt (Onsite Role)
Employment Type: Full-Time
We are seeking a highly experienced Lead MLOps Architect with deep AWS expertise to lead the design, architecture, and governance of enterprise-grade ML platforms. This role requires strong leadership capabilities, hands-on expertise in scalable ML systems, and experience managing large production environments.
Key Responsibilities- Architect and lead enterprise-scale MLOps platforms on AWS
- Define best practices for ML lifecycle management, deployment standards, and governance
- Lead production deployment of ML models using AWS-native services
- Design automated CI/CD pipelines for ML workflows and infrastructure
- Implement advanced monitoring, drift detection, retraining automation, and observability
- Ensure high availability, scalability, security, and cost optimization
- Establish model versioning, reproducibility, and experiment tracking standards
- Lead troubleshooting of complex production issues
- Mentor and lead a team of MLOps and platform engineers
- Collaborate with stakeholders to align ML platform strategy with business objectives
- 10–12 years of overall experience with strong focus on ML production systems
- Proven experience leading ML platform architecture and large-scale deployments
- Deep understanding of ML lifecycle management, governance, and reproducibility
- Hands-on experience with TensorFlow, PyTorch, Scikit-learn
- Strong experience with MLflow or enterprise model management tools
- Advanced hands-on expertise in:
- Amazon SageMaker (training, pipelines, endpoints)
- S3, EC2, Lambda
- ECR, ECS, EKS
- IAM, CloudWatch
- Experience designing secure, compliant, and scalable ML architectures
- Experience implementing cost optimization strategies on AWS
- Strong expertise in Docker and Kubernetes (EKS)
- Advanced CI/CD implementation
- Infrastructure as Code using Terraform and/or CloudFormation
- Experience implementing GitOps practices
- Expert-level Python skills
- Experience designing robust data pipelines
- Strong understanding of SQL/NoSQL systems
- Exposure to streaming or real-time ML systems
- AWS Professional-level certifications
- Experience with ML security, explainability, and regulatory compliance
- Experience building enterprise feature stores
- Exposure to real-time inference systems
Top Skills
Tensorflow,Pytorch,Scikit-Learn,Mlflow,Amazon Sagemaker,S3,Ec2,Lambda,Ecr,Ecs,Eks,Iam,Cloudwatch,Docker,Kubernetes,Terraform,Cloudformation,Gitops,Python,Sql,Nosql
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
NorthBay is an AWS Premier Partner focused on Database & Application migrations, data & analytics, DevOps & DataOps, application modernization and ML/Ai.
Our practice areas include big data and analytics, machine learning, artificial intelligence and database migrations.








