AI/ML Engineer III

Posted 2 Days Ago
Be an Early Applicant
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
In-Office
Mid level
Information Technology
The Role
Design, build, and deploy AI/ML and GenAI solutions (LLMs and AI agents) for IT automation and AIOps. Lead solution ownership in Agile, develop/run machine learning models, integrate open-source and proprietary LLMs, build runbook executors (PowerShell/Python), work with cloud and container platforms, engage stakeholders, and document automation and models.
Summary Generated by Built In

Job Description

Responsibilities:

  • Contribute toward AI roadmap for AI-driven automation aligning with strategic company and client goals.
  • Collaborate with engineering and data teams to design and architect scalable, robust, and innovative AI solutions (e.g., automated network diagnostics, bot recommendation systems, AI agents NOC operations).
  • Act as the Solution Owner in an Agile/Scrum environment, managing the product backlog, writing detailed user stories, defining acceptance criteria, and prioritizing features.
  • Lead the evaluation, fine-tuning, and integration of open-source Large Language Models (LLMs) like Llama series & other LLMs.
  • Develop and own key components of the automation framework, including PowerShell and Python runbook executors.
  • Develop and train machine learning models for tasks such as anomaly detection, predictive maintenance, and capacity planning in IT environments.
  • Work with large datasets of IT operational data, performing data cleaning, feature engineering, and data analysis to improve model accuracy and performance.
  • Contribute to the development of internal automation tools and frameworks.
  • Deliver continuous service improvements by proactively identifying opportunities for process enhancements.
  • Proficiently develop AI/Gen AI point solutions to meet business needs.
  • Troubleshoot and resolve issues related to automation systems and AI models.
  • Engage with customers, IT operators, Network Engineers, and internal stakeholders to gather requirements, validate solutions, and ensure product-market fit.
  • Serve as the Subject Matter Expert (SME) on AIOps, IT Process Automation (ITPA), and Runbook Automation (RBA), providing technical guidance and insights.
  • Document automation processes, code, and models clearly and concisely.
  • Actively contribute to team results and work towards achieving team goals and objectives.
  • Undertake designated skill/knowledge development within the organization, including training of next-level team members.

Required Skills & Competencies:

  • 4+ years of experience in IT services, with at least 3 years in:
    • IT Service Automation – Orchestration, Scripting, & Process Assessment.
    • AI Engineering – Developing and deploying AI Agents and cutting-edge GenAI & ML solutions to address complex business challenges.
  • Expertise in integrating various tools and building analytics/insights.
  • Proficiency in at least two scripting languages such as PowerShell, Python, or Shell Script.
  • Strong foundational knowledge across core IT domains, with a specific emphasis on Network & Network Services, including understanding network topologies, protocols, and common operational issues.
  • Demonstrable experience implementing solutions using state-of-the-art LLMs, with hands-on experience with both open-source models (e.g., Llama series) and proprietary models (e.g., GPT-4).
  • Proficiency with a major deep learning framework, preferably PyTorch, for model experimentation and fine-tuning.
  • Proficiency in a high-level programming language (Java, Python) and experience with containerization technologies (Docker, Kubernetes) for deploying AI models.
  • Solid understanding and practical experience with cloud platforms such as Azure or GCP, including knowledge of their networking services (e.g., VNets, VPCs).
  • Experience with AI/ML libraries and frameworks.
  • A strong understanding of ITIL process on one or more service lifecycle or service capability modules.
  • Knowledge in one or more System administrative activities like monitoring, service requests, incident management, change management, & maintenance.
  • Excellent communication skills with an ability to explain concepts and solutions clearly and concisely.
  • Excellent analytical and problem-solving skills.

Skills Required

  • 4+ years of experience in IT services, with at least 3 years in IT Service Automation and AI Engineering
  • Experience in IT Service Automation: orchestration, scripting, and process assessment
  • Experience developing and deploying AI agents, GenAI, and ML solutions
  • Expertise integrating tools and building analytics/insights
  • Proficiency in at least two scripting languages such as PowerShell, Python, or Shell Script
  • Strong foundational knowledge of network and network services, including topologies, protocols, and operational issues
  • Demonstrable experience with state-of-the-art LLMs (open-source e.g., Llama series and proprietary e.g., GPT-4)
  • Proficiency with a deep learning framework, preferably PyTorch, for experimentation and fine-tuning
  • Proficiency in a high-level programming language (Java or Python)
  • Experience with containerization technologies such as Docker and Kubernetes
  • Practical experience with cloud platforms (Azure or GCP) including networking services (VNets, VPCs)
  • Familiarity with AI/ML libraries and frameworks
  • Strong understanding of ITIL processes (one or more service lifecycle or capability modules)
  • Knowledge of system administrative activities: monitoring, service requests, incident management, change management, maintenance
  • Excellent communication, analytical, and problem-solving skills

Astreya Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Astreya and has not been reviewed or approved by Astreya.

  • Healthcare Strength Health coverage includes multiple medical plan options plus dental and vision, complemented by FSAs, an EAP, disability and life insurance, and wellness programs. Feedback suggests these offerings provide solid core protection across many roles.
  • Wellbeing & Lifestyle Benefits Client-site amenities at some large tech campuses can add non-cash value such as meals or on-site perks that enhance the day-to-day experience. Wellness Days and access to learning resources and tuition reimbursement further support overall wellbeing.
  • Flexible Benefits Choice among medical plan types and tax-advantaged accounts enables some customization to individual needs. Some roles also offer remote or flexible work, adding practical flexibility to the total package.

Astreya Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, California
1,958 Employees
Year Founded: 2001

What We Do

Astreya is the leading IT solutions provider for some of the world's most recognizable and innovative organizations. Our journey started in 2001 in the heart of Silicon Valley and reaches thirty-three countries with over 2200+ IT professionals. We enable businesses to make better decisions, achieve operational efficiency and gain a competitive edge. The Astreya advantage is centered around focus and clear- vision, world-class talent, and innovative technology: Creativity is in our DNA. Our dedicated Software and Service Innovation teams bring best-in-class technology and tools to bear for our clients.

Similar Jobs

Tapestry - Coach and Kate Spade Logo Tapestry - Coach and Kate Spade

Sr. Sales Associate

eCommerce • Fashion • Retail • Sales • Wearables • Design
Hybrid
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
16000 Employees

Zscaler Logo Zscaler

Sales Engineer

Cloud • Information Technology • Security • Software • Cybersecurity
Easy Apply
Remote or Hybrid
Malaysia
8697 Employees

Capco Logo Capco

Data Architect

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Hybrid
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
6000 Employees

Capco Logo Capco

Enterprise Data Modeller

Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Hybrid
Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, MYS
6000 Employees

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account