The Role
As a Senior Data Scientist, you'll design datasets, conduct model pre-training, and collaborate with teams to develop LLM features for diverse users.
Summary Generated by Built In
About the Role
We are building a multilingual Large Language Model tailored for Bahasa Indonesia and regional languages. We are looking for a passionate Senior Data Scientist to help shape the future of open and inclusive AI for Indonesia, as well as playing a pivotal role in identifying impactful AI use cases. As a Senior Data Scientist working on LLMs, you will design and build high-quality datasets, advanced model pre-training, fine tuning and alignment techniques, and collaborate closely with product and engineering teams to ship safe, reliable LLM-powered features to millions of users. This role offers the opportunity to drive innovation, solve critical business challenges, and shape the future of AI-driven solutions at GoTo Group.
What You Will Do
- Perform data annotation and labeling based on provided guidelines
- Validate language accuracy, grammar, and contextual relevance
- Review annotated datasets to identify and correct errors
- Ensure consistency and quality across large volumes of data
- Collaborate with internal teams to refine annotation processes
- Provide feedback to improve annotation guidelines and workflows
What You Will Need
- 4+ years of experience in LLM, Deep Learning, NLP, Computer Vision, or Voice.
- Proficient in data preprocessing, model training, evaluation, and optimisation.
- Practical experience in applying deep learning to solve real business problems, with models successfully deployed and used in production environments.
- Proficient with Python and deep learning frameworks such as PyTorch or Tensorflow.
- Experience with cloud platforms like Alicloud or Tencent.
- Strong communication skills to understand business needs and effectively convey analytical solutions.
- Ability to write clear and concise technical documentation.
- A Master’s or PhD in Computer Science, Data Science, AI, or a related field.
- Understanding Bahasa Indonesia will be an advantage.
About the Team
The LLM team is on a mission to build the most capable and culturally-aligned multilingual LLMs for Indonesia. At GoTo Group, the team is at the forefront of developing state-of-the-art language models. We are building foundational AI models that understand and generate Bahasa Indonesia and regional languages – empowering more inclusive technology. We work on everything from continual pretraining large-scale LLMs to alignment and safety fine-tuning, using both structured and unstructured data from diverse sources. Our projects span core model development, dataset curation, safety systems, and real-world deployment in consumer and enterprise applications. Our team brings together members with diverse technical and cultural backgrounds, bringing expertise in machine learning and local languages.
Top Skills
Alicloud
Python
PyTorch
Tencent
TensorFlow
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
GoTo is the largest technology group in Indonesia, combining on-demand, e-commerce and financial services through the Gojek, Tokopedia and GoTo Financial brands. It is the first platform in Southeast Asia to host these three essential use cases in one ecosystem, capturing a majority of Indonesian consumer household expenditure. GoTo’s mission is to “Empower Progress” by offering an unparalleled selection of goods and services through a comprehensive merchant and partner network and promoting financial inclusion through its leading payments and financial services business.






.png)