Large language model KazLLM presented in Kazakhstan

Large language model KazLLM presented in Kazakhstan

The development of the first version of the modern language model KazLLM has been completed in Astana. It understands not only the state language but also Russian, English, and Turkish. This is a scientific project by the Institute of Smart Systems and Artificial Intelligence (ISSAI) at Nazarbayev University. Essentially, the large language model is the cornerstone on which Kazakhstan's IT community can build future products and services.

«The model is a symbol of sovereignty in the field of artificial intelligence. We have collected over 150 billion tokens. A token is a unit of data. We can say that a token is essentially a word,» noted Madina Abdrakhmanova, deputy director of the ISSAI at Nazarbayev University.

Kazakh scientists develop online translator

The country has also developed its first domestic multifunctional application, Soyle App. It is based on a fundamental speech model designed to meet the needs of both local and global markets. This product also addresses a strategically important mission for Kazakhstan - ensuring the information security of users who entrust their content to online translators.

«Soyle» is not only capable of translating between four languages - Kazakh, Russian, English, and Turkish - but it can also convert speech to text, text into another language, text into speech, and switch between languages. However, it cannot do this in real-time yet; this feature is still under development. Many of us currently use ChatGPT. The problem with ChatGPT is that when you use it, especially the free version, all your data is leaked. Therefore, it is especially important for many countries to develop domestic language models. This is truly a matter of the country’s AI sovereignty,» Abdrakhmanova added.