GoTo Group
Senior Data Scientist - LLM
TLDR
Design and build high-quality datasets and advanced model techniques for multilingual large language models in a pioneering team shaping AI technology for Indonesia.
About the Role
We are building a multilingual Large Language Model tailored for Bahasa Indonesia and regional languages. We are looking for a passionate Senior Data Scientist to help shape the future of open and inclusive AI for Indonesia, as well as playing a pivotal role in identifying impactful AI use cases. As a Senior Data Scientist working on LLMs, you will design and build high-quality datasets, advanced model pre-training, fine tuning and alignment techniques, and collaborate closely with product and engineering teams to ship safe, reliable LLM-powered features to millions of users. This role offers the opportunity to drive innovation, solve critical business challenges, and shape the future of AI-driven solutions at GoTo Group.
What You Will Do
What You Will Need
4+ years of experience in LLM, Deep Learning, NLP, Computer Vision, or Voice.
Perform data annotation and labeling based on provided guidelines
Validate language accuracy, grammar, and contextual relevance
Review annotated datasets to identify and correct errors
Ensure consistency and quality across large volumes of data
Collaborate with internal teams to refine annotation processes
Provide feedback to improve annotation guidelines and workflows
Proficient in data preprocessing, model training, evaluation, and optimisation.
Practical experience in applying deep learning to solve real business problems, with models successfully deployed and used in production environments.
Proficient with Python and deep learning frameworks such as PyTorch or Tensorflow.
Experience with cloud platforms like Alicloud or Tencent.
Strong communication skills to understand business needs and effectively convey analytical solutions.
Ability to write clear and concise technical documentation.
A Master’s or PhD in Computer Science, Data Science, AI, or a related field.
Understanding Bahasa Indonesia will be an advantage.
About the Team
The LLM team is on a mission to build the most capable and culturally-aligned multilingual LLMs for Indonesia. At GoTo Group, the team is at the forefront of developing state-of-the-art language models. We are building foundational AI models that understand and generate Bahasa Indonesia and regional languages – empowering more inclusive technology. We work on everything from continual pretraining large-scale LLMs to alignment and safety fine-tuning, using both structured and unstructured data from diverse sources. Our projects span core model development, dataset curation, safety systems, and real-world deployment in consumer and enterprise applications. Our team brings together members with diverse technical and cultural backgrounds, bringing expertise in machine learning and local languages.
GoTo Group builds the largest digital ecosystem in Indonesia, offering a robust range of services that include mobility, delivery, payments, and financial solutions. It's designed to empower everyone—individuals and businesses alike—to access and thrive in the digital economy. What sets GoTo apart is its commitment to creating a significant social impact through technology that benefits all layers of society.
- Founded
- Founded 2021
- Employees
- 500+ employees
- Industry
- IT Services
- Total raised
- $1.4B raised
Senior Data Scientist