AI Engineer (AI Products)

Posted 7 Days Ago
Be an Early Applicant
Singapore, SGP
In-Office
Mid level
Artificial Intelligence • Healthtech • Information Technology • Biotech
The Role
Develop, train, fine-tune and evaluate multilingual and multimodal LLMs; build scalable multilingual text data pipelines; support large-scale distributed training, experiment tracking and compute optimisation; reproduce and translate research into production improvements; create reusable training/evaluation assets and deployment-ready components; collaborate with academic and industry partners and support community outreach.
Summary Generated by Built In

AI Singapore (AISG) is a national AI programme launched by the National Research Foundation (NRF), Singapore, to build and anchor deep national capabilities in AI.  AISG is supported through a government-wide partnership including the NRF, Ministry of Digital Development and Information (MDDI), Infocomm Media Development Authority (IMDA), Economic Development Board (EDB) and Enterprise Singapore (ESG). We bring together research institutions and the vibrant ecosystem of AI start-ups and companies to support impactful research, develop talent, and power Singapore's AI efforts.

The AI Products Pillar of AI Singapore develops AI-powered products and solutions that benefit Singapore and the region. We are looking for an AI Engineer to contribute to the development, training, evaluation and engineering of state-of-the-art multilingual, multicultural and multimodal foundation Large Language Models (LLMs), with a focus on underrepresented languages and cultures and real-world AI applications. The role involves working at the intersection of frontier LLM research, large-scale data engineering, distributed model training, evaluation, and productisation.

This position will be hosted at the Nanyang Technological University (NTU) under VP (Artificial Intelligence & Digital Economy)’s office and we welcome you to join our community.

Key Responsibilities:

  • Develop, train, fine-tune and evaluate multilingual, multicultural and multimodal foundation LLMs and related technologies, including continued pre-training, supervised fine-tuning, instruction tuning, preference optimisation and model adaptation.

  • Build and maintain scalable data pipelines for multilingual text ingestion, cleaning, filtering, deduplication, tokenisation, contamination detection and dataset mixture creation.

  • Support large-scale distributed model training workflows, including experiment tracking, checkpointing, training monitoring, debugging, profiling and compute optimisation.

  • Review frontier AI and LLM research papers, reproduce relevant methods, and translate research findings into practical improvements in model training, evaluation and deployment workflows.

  • Contribute to reusable engineering assets such as training scripts, evaluation tools, model artefacts, APIs, documentation and deployment-ready components.

  • Collaborate with internal teams, external research labs, academic institutions, industry partners and ecosystem stakeholders on LLM research, development and knowledge-sharing activities.

  • Support AI Singapore’s community-building efforts through technical sharing, seminars, workshops, publications or other outreach activities where relevant.

Requirements:

We welcome applicants from all disciplines and qualification pathways, including NITEC, Diploma, Bachelor’s, Master’s or PhD holders, as well as candidates with equivalent practical experience. A formal qualification in Computer Science, Artificial Intelligence, Machine Learning, Data Science, Engineering, Mathematics, Computational Linguistics or a related field would be advantageous but is not mandatory.

Candidates should be able to demonstrate the following skills and experience:

  • Strong AI/ML engineering capability: Proficient in Python with strong software engineering practices, including clean and modular code, Git, testing, documentation and code review. Hands-on experience with PyTorch and modern NLP/LLM frameworks, especially the Hugging Face ecosystem such as Transformers, Datasets, Tokenizers, Accelerate and TRL.

  • LLM development and multilingual NLP expertise: Experience or strong working knowledge in LLM pre-training, continued pre-training, fine-tuning, instruction tuning, preference optimisation, evaluation and model adaptation. Familiarity with multilingual NLP challenges such as low-resource languages, code-switching, language imbalance, script diversity, tokenisation issues and multilingual benchmark design.

  • Large-scale data and training pipeline development: Ability to build scalable pipelines for text processing, dataset preparation and model training. Familiarity with distributed training concepts, GPU-based training, checkpointing, experiment tracking, training stability and compute optimisation.

  • MLOps, infrastructure and deployment: Familiarity with Linux environments, Docker, CI/CD concepts, cloud platforms or GPU/HPC clusters. Experience with reproducible model development workflows, model versioning, evaluation automation and deployment through APIs or serving frameworks such as vLLM, SGLang, TGI or equivalent tools would be advantageous.

  • Research, problem-solving and communication: Able to read and evaluate research papers, design experiments, conduct ablation studies, analyse results and translate findings into practical model or system improvements. Able to communicate technical concepts, trade-offs and evaluation outcomes clearly to both technical and non-technical stakeholders.

  • Personal attributes: Self-directed learner with strong curiosity, high ownership and resilience. Collaborative, rigorous and engineering-minded, with a commitment to reproducibility, reliability, documentation and responsible AI development.

We regret that only shortlisted candidates will be notified.

Hiring Institution: NTU

Skills Required

  • Proficient in Python and strong software engineering practices (clean modular code, testing, documentation, code review)
  • Experience with Git
  • Hands-on experience with PyTorch
  • Hands-on experience with Hugging Face ecosystem (Transformers, Datasets, Tokenizers, Accelerate)
  • Experience with TRL (Training Reinforcement Learning) or similar instruction-tuning frameworks
  • Experience or strong working knowledge in LLM pre-training, fine-tuning, instruction tuning, preference optimisation and model adaptation
  • Experience in multilingual NLP challenges (low-resource languages, code-switching, tokenisation issues, benchmark design)
  • Ability to build scalable data pipelines for text ingestion, cleaning, filtering, deduplication, tokenisation and dataset mixture creation
  • Familiarity with distributed training concepts, GPU-based training, checkpointing, experiment tracking, training stability and compute optimisation
  • Familiarity with Linux environments
  • Familiarity with Docker and CI/CD concepts
  • Familiarity with cloud platforms or GPU/HPC clusters
  • Ability to read and evaluate research papers, design experiments, run ablation studies and analyse results
  • Good communication skills to explain technical concepts to technical and non-technical stakeholders
  • Formal qualification in CS/AI/ML/Data Science/Engineering/Mathematics/Computational Linguistics or related (advantageous, not mandatory)
  • Experience with deployment/serving frameworks such as vLLM, SGLang, TGI or equivalent
  • Experience with reproducible model development workflows, model versioning and evaluation automation
  • Self-directed learner with high ownership, collaboration, rigor and commitment to responsible AI
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Singapore
10 Employees
Year Founded: 2020

What We Do

The Lee Kong Chian School of Medicine (LKCMedicine) trains doctors with a focus on patient-centered care, integrating precision medicine, Artificial Intelligence (AI) in healthcare, and medical humanities into its undergraduate medical degree program.

Similar Jobs

Wise Logo Wise

Copywriter

Fintech • Mobile • Payments • Software • Financial Services
Hybrid
Singapore, SGP
9000 Employees
7K-9K Annually

BlackRock Logo BlackRock

Windows Engineer, Associate

Fintech • Information Technology • Financial Services
In-Office
Singapore, SGP
25000 Employees

BlackRock Logo BlackRock

Director - Delivery Lead, Aladdin Client Transformation

Fintech • Information Technology • Financial Services
In-Office
Singapore, SGP
25000 Employees

CSC Logo CSC

Compliance Manager

Fintech • Legal Tech • Software • Financial Services • Cybersecurity • Data Privacy
Remote or Hybrid
2 Locations
8500 Employees

Similar Companies Hiring

Hanover Park Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
42 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account