L3 Support Engineer (Full Stack Support)

Reposted 14 Days Ago
Be an Early Applicant
Hiring Remotely in Pakistan
Remote or Hybrid
Senior level
Information Technology • Software
The Role
The L3 Support Engineer is responsible for managing high-severity production incidents, improving AI-driven workflows, mentoring junior engineers, and ensuring system reliability.
Summary Generated by Built In

Job Title
L3 Support Engineer – Agentic AI, Automation & Reliability (Full‑Stack Support)

Role Overview

As an L3 Support Engineer – Agentic AI, Automation & Reliability, you will play a critical role in ensuring the stability, performance, and continuous improvement of AIM’s cloud‑based and distributed systems. Operating as a senior escalation point, you will own high‑severity (P1/P2) production incidents end to end—driving rapid troubleshooting, remediation, root cause analysis, and long‑term prevention across applications, integrations, and cloud infrastructure.

This role goes beyond traditional support. You will actively design, operate, and improve AI‑driven and automated support workflows, including agent‑based ticket triage, LLM‑assisted diagnostics, and self‑healing runbooks. Working closely with global teams and North American stakeholders, you will combine deep technical expertise with strong communication skills to lead major incident bridges, produce clear RCAs, and mentor L1/L2 engineers in adopting automation‑first and AI‑assisted operating practices.

Location
Remote (Pakistan)

Work Hours
8:00 AM – 5:00 PM Eastern Time, with participation in a global on‑call rotation for critical incidents.

About AIM 

AIM is a Canadian technology company that helps organizations modernize their systems through advanced API management, cloud engineering, security solutions, and full-stack software development. Our teams work across North America and globally, delivering stable, scalable, and secure digital platforms for enterprise clients.

We take pride in being hands-on, collaborative, and focused on delivering real results for our clients. As we grow, we are expanding our marketing team to strengthen our brand presence and support our next stage of growth.

Core Technical Skills

  • Strong troubleshooting skills across applications, infrastructure, and integrations, with ownership of P1/P2 incidents end‑to‑end (detection, mitigation, RCA, and prevention).
  • Solid understanding and practical application of ITIL processes (Incident, Problem, Change Management) in an ITSM tool such as Jira Service Management, ServiceNow, or ManageEngine.
  • Scripting and automation skills in at least one of: Python (preferred), PowerShell, or Bash, with examples of automating repetitive operational tasks (ticket handling, health checks, log analysis, etc.).
  • Experience working with APIs (REST, Graph API) and integrating systems and workflows using APIs and webhooks.
  • Working knowledge of a major cloud platform, preferably Microsoft Azure (compute, storage, networking, identity, monitoring/alerts). Experience with AWS or GCP is acceptable if you are willing to ramp up on Azure.

Agentic AI & Automation Skills

Must‑Have

  • Practical experience designing, configuring, or operating AI‑driven or agent‑based workflows (e.g., autonomous ticket triage, virtual agents, or LLM‑assisted runbooks).
  • Understanding of prompt engineering basics, how AI agents call tools/APIs, and how context/memory is managed in such systems.
  • Awareness of AI risks (hallucinations, unsafe actions) and how to implement guardrails, human‑in‑the‑loop controls, and governance policies.

Nice‑to‑Have

  • Familiarity with Retrieval‑Augmented Generation (RAG), vector databases, semantic search, or multi‑agent orchestration frameworks.

Technology Stack (Exposure Expected)

  • Cloud: Microsoft Azure (preferred), and/or AWS/GCP.
  • ITSM: Jira Service Management (preferred), ManageEngine, ServiceNow, or similar.
  • Observability: Azure Monitor, Datadog, Splunk, Prometheus, or equivalent tools for logs, metrics, traces, and alerting.
  • Bonus: Knowledge of containers and orchestration (Docker, Kubernetes) is an asset but not mandatory.

Soft Skills & Operating Expectations

  • Excellent written and verbal English communication, able to lead major incident bridges and produce clear incident reports and RCAs for North American stakeholders.
  • Strong ownership mindset; comfortable operating across L1/L2/L3 when needed, while driving automation and self‑healing to reduce manual workload.
  • Ability to mentor L1/L2 engineers in using AI‑driven tools and adopting automation‑first practices.
  • Comfortable working permanently 9–5 EST from Pakistan and participating in an on‑call rotation for after‑hours incidents as part of a global support model.

Minimum Experience

  • 5–8 years in Production Support, Support Engineering, or Site Reliability Engineering, including at least 3 years handling L2/L3 escalations in cloud or distributed systems.
  • Proven experience working with international customers (North America or Europe) and operating in shift‑based or evening/night schedules.
  • Hands‑on experience in environments where AI‑driven or automated workflows are used for support, operations, or reliability.

Preferred Qualifications

  • AZ-104 Microsoft Certified: Azure Administrator Associate
  • AI-103 Microsoft Certified: Azure AI Apps and Agents Developer Associate certification
  • AZ-700 Microsoft Certified: Azure Network Engineer Associate
  • SC-401 Microsoft Certified: Information Security Administrator Associate

Preferred Qualifications

  • AZ-305 Microsoft Certified: Azure Solutions Architect Expert
  • Certifications in ITIL, Azure/AWS/GCP, or AI/ML disciplines.
  • Experience in managed services or SaaS environments with multi‑tenant architectures.
  • Familiarity with compliance and security frameworks such as SOC 2 and ISO 27001.

Skills Required

  • 5-8 years in Production Support, Support Engineering, or Site Reliability Engineering
  • 3 years handling L2/L3 escalations in cloud or distributed systems
  • Experience working with international customers
  • Hands-on experience in AI-driven or automated workflows
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Oakville, ON
46 Employees
Year Founded: 2006

What We Do

AIM is a specialized company providing a wide range of API management solutions and services to ensure optimal performance and security for businesses. Key offerings include API Health Check, which addresses vulnerabilities and potential improvements; Managed Support Services for active API management software maintenance; and Dedicated API Expert Support for personalized assistance. AIM also offers Certified Trainers for API strategy, design, and management, as well as Custom Approach to Training tailored to specific business needs. To maximize ROI, AIM provides Upgrade Service, assisting teams in transitioning to the latest product versions with training and knowledge transfer. With a proactive approach, AIM offers 24/7 infrastructure monitoring and operational support during business hours. Other services include Native Monitoring with a log analyzer for security threats, Threat and Risk Assessments (TRA) for enterprise-wide and system-specific evaluations, Technical Vulnerability Assessment and Penetration Testing for network infrastructure, computing layer, and application layer assessments, and Information Security Health Check for evaluating critical security elements and providing improvement recommendations. By offering a comprehensive suite of services tailored to the unique needs of businesses relying on API management solutions, AIM helps organizations stay competitive and secure in the fast-paced digital economy.

Similar Jobs

Octus Logo Octus

Team Lead

Fintech • News + Entertainment • Software • Database • Financial Services
Easy Apply
Remote or Hybrid
Pakistan
808 Employees

Circle (circle.so) Logo Circle (circle.so)

Lead Product Designer

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Easy Apply
Remote
31 Locations
250 Employees
140K-170K Annually

Motive Logo Motive

Account Manager

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
Remote
Pakistan
4000 Employees

Motive Logo Motive

Implementation/Installation Strategist

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Easy Apply
Remote
Pakistan
4000 Employees

Similar Companies Hiring

Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account