Sahabat-AI представляет собой специализированную версию Gemma2 9B, обученную понимать индонезийский язык и его многочисленные диалекты. Этот ИИ-проект направлен на создание доступной цифровой среды для миллионов пользователей в Юго-Восточной Азии.
Sahabat-AI (Indonesian language for “close friends”) is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for Indonesian language and its various dialects. Sahabat-AI ecosystem is co-initiated by Indonesian tech and telecommunication companies: GoTo Group and Indosat Ooredoo Hutchison. Gemma2 9B CPT Sahabat-AI v1 Instruct is an Indonesian-focused model which has been fine-tuned with around 448,000 Indonesian instruction-completion pairs alongside an Indonesian-dialect pool consisting of 96,000 instruction-completion pairs in Javanese and 98,000 instruction-completion pairs in Sundanese. Additionally, we added a pool of 129,000 instruction-completion pairs in English. Co-initiated by: PT GoTo Gojek Tokopedia Tbk, Indosat Ooredoo Hutchison Developed by: PT GoTo Gojek Tokopedia Tbk, AI Singapore Model type: Decoder Languages: English, Indonesian, Javanese, Sundanese License: Gemma Community License