Platform SRE and Reliability Engineer

Posted 17 Days Ago
Be an Early Applicant
Abu Dhabi
In-Office
Senior level
Artificial Intelligence • Information Technology • Software
The Role
Design and implement automated SRE and AI QA frameworks to validate conversational AI, RAG pipelines, and banking microservices. Integrate reliability testing into CI/CD for MLOps, perform load and failover testing across Azure and AWS, and enforce data integrity, fairness, and compliance. Act as technical liaison translating reliability requirements into business-facing quality narratives.
Summary Generated by Built In

DeepLight AI is a specialist AI and data consultancy with extensive experience implementing intelligent enterprise systems across multiple industries, with particular depth in financial services and banking. Our team combines deep expertise in data science, statistical modeling, AI/ML technologies, workflow automation, and systems integration with a practical understanding of complex business operations.

The Platform SRE and Reliability Engineer is responsible for ensuring the absolute quality, resilience, and performance of the Bank’s next-generation AI and digital platforms. This role focuses on the high-stakes intersection of Site Reliability Engineering (SRE) and AI Quality Assurance, designing automated frameworks to validate everything from Conversational AI agents and RAG pipelines to core banking microservices. By implementing robust continuous testing pipelines and reliability governance, you will guarantee that the Bank’s AI-driven experiences remain secure, scalable, and deterministically accurate under real-world conditions.

As the Platform SRE & Reliability Engineer, your responsibilities include:

  • Building reusable automation frameworks to test the accuracy, stability, latency, and safety of Conversational AI platforms (voice and chat) and LLM-based agents.
  • Validating multi-agent orchestration, human-in-the-loop escalation logic, and the integrity of RAG pipelines and vector search results.
  • Testing AI/ML platform components for scaling behavior, failover resilience, high availability, and disaster recovery.
  • Integrating automated test pipelines into CI/CD workflows for MLOps, focusing on drift detection, retraining validation, and model registry integrity.
  • Verifying AI/ML pipelines on Azure AI Foundry and AWS SageMaker, ensuring data integrity across storage services (S3/Blobs) and serverless functions.
  • Conducting load testing for AI services and ensure engineering guardrails for fairness, explainability, and regulatory compliance are enforced.
  • Acting as a bridge between engineering and business, translating complex technical reliability requirements into actionable quality narratives.

As an AI consultancy, our greatest asset is the expertise of our people.


While technical mastery is the foundation of what we do, the ability to bridge the gap between complex data science and actionable business value is what defines your success with Deeplight.


We're looking for individuals who are not only world-class in their fields of specialism, but also compelling communicators and persuasive advocates for their own skills.


You will be the face of our firm, tasked with building trust, articulating the "why" behind your technical decisions, and effectively "selling" your vision to high-level stakeholders.


If you thrive on the challenge of presenting cutting-edge solutions as much as you do on building them, you will fit right in.


Requirements

To be successful in this role, we need you to have:

  • A Bachelor’s degree in Computer Science, AI, Software Engineering, or a related quantitative field. A Master’s degree in AI/ML is highly preferred.
  • 5+ years in QA, Application Testing, or Reliability Engineering, ideally for a large-scale brand or digital-only bank.
  • Proven track record in deploying AI/ML QA solutions at an enterprise scale within the financial services sector.
  • Experience testing distributed architectures, microservices, and large-scale data platforms (Vector DBs, Data Lakes).
  • Expertise in Python-based automation frameworks and tools such as Selenium, Playwright, PyTest, JMeter, and Locust.
  • A deep understanding of LLM evaluation frameworks, prompt stability testing, and hallucination avoidance validation.
  • Hands-on experience testing and validating services across both Azure and AWS cloud environments.
  • Strong SQL/NoSQL validation skills (Postgres, MongoDB) and experience testing REST, GraphQL, and FastAPI integrations.
  • Be proficient in testing within Docker and Kubernetes (EKS/AKS) environments.


It would be beneficial if you also had:

  • An ability to evaluate and adopt emerging QA tools for AI frameworks like LangChain, CrewAI, and Bedrock.
  • An understanding of cutting-edge quality trends, including multimodal QA and RLHF (Reinforcement Learning from Human Feedback) output evaluation.
  • A proactive approach to identifying edge cases in AI agents that could impact banking compliance or customer experience.
  • A strong ability to coordinate with different functional teams to implement models and monitor outcomes.

Benefits

Benefits & Growth Opportunities:

  • Competitive salary.
  • Visa Sponsorship for the successful individual.
  • Comprehensive health insurance for the successful individual.
  • Professional development and certification support.
  • Opportunity to work on cutting-edge AI projects.
  • Career advancement opportunities in a rapidly growing AI company.

This position offers a unique opportunity to shape the future of AI implementation while working with a talented team of professionals at the forefront of technological innovation. The successful candidate will play a crucial role in driving our company's success in delivering transformative AI solutions to our clients.

At DeepLight AI, we recognise that diversity drives innovation. We are committed to fostering an inclusive environment where individuals with different thinking styles can thrive and contribute their unique strengths to our specialised AI and data solutions.

Our goal is to ensure our application and interview process is accessible, predictable, and fair for all candidates.

If you require any specific adjustments to the application process, or if you require any reasonable adjustments should you be successful in being processed to the interview stage, please do let us know. This information will be kept strictly confidential and will not impact hiring decisions.

Top Skills

Python,Selenium,Playwright,Pytest,Jmeter,Locust,Azure Ai Foundry,Aws Sagemaker,S3,Azure Blob Storage,Vector Dbs,Data Lakes,Postgres,Mongodb,Rest,Graphql,Fastapi,Docker,Kubernetes,Eks,Aks,Langchain,Crewai,Bedrock,Rlhf,Rag,Llm,Conversational Ai
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Dubai
19 Employees

What We Do

Welcome to Deeplight, the pioneering consultancy dedicated to guiding businesses through every facet of the AI adoption journey. From initial industry awareness to seamless integration and optimization, Deeplight is your strategic partner for unlocking the full potential of Artificial Intelligence.

The Enterprise Innovation Roadmap: Navigating Your AI Journey

EXECUTIVE COACHING: Propel your leadership team to new heights with personalized coaching driven by AI insights and guidance. Deeplight ensures your leadership makes informed decisions and drives innovation at the highest level.

CURRENT STATE ASSESSMENT: Embark on your AI journey with clarity by conducting a data-driven assessment. Gain a comprehensive understanding of your organization's AI readiness, identifying key strengths, weaknesses, and opportunities to maximize your return on AI investment.

DATA READINESS INITIATIVES: Elevate your data game with Deeplight's tailored data cleansing, organization. Bridge the gap between your current data and the needs of successful AI implementation, ensuring your AI models have the fuel they need to thrive.

USE CASE WORKSHOPS: Collaboratively explore the most impactful AI applications for your unique business needs. Deeplight facilitates brainstorming sessions, feasibility studies, and prioritization to identify use cases with the highest potential for success. Turn ideas into actionable strategies with our expert guidance.

CUSTOM MODEL DESIGN: Deeplight goes beyond basic AI models with custom "AiGent" design. Craft intelligent agents tailored to your challenges and goals, ensuring optimal performance and alignment with your strategic vision. Redefine what's possible with AI for your business.

Ready to elevate your business through the power of AI?

Deeplight serves as your beacon of innovation, guiding businesses through the complex process of AI adoption. Connect with us to embark on a transformative AI journey that propels your enterprise to new heights.

Similar Jobs

Immersive Logo Immersive

Consultant

Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
Remote or Hybrid
UAE
330 Employees

Immersive Logo Immersive

Cyber Resilience Advisor - Dubai, UAE

Enterprise Web • HR Tech • Information Technology • Software • Cybersecurity
Remote or Hybrid
UAE
330 Employees

CrowdStrike Logo CrowdStrike

Sales Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
UAE
10000 Employees

CrowdStrike Logo CrowdStrike

Account Executive

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
UAE
10000 Employees

Similar Companies Hiring

Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY
Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account