SozKZ Core: Kazakh Language Models Collection Base, instruct, and balanced Kazakh language models trained from scratch — Llama (50M–600M), GPT2, Pythia architectures • 22 items • Updated about 16 hours ago
SozKZ: Training Efficient Small Language Models for Kazakh from Scratch Paper • 2603.20854 • Published 5 days ago • 1