SloPalSpeech: A 2,8000-Hour Slovak Speech Corpus from Parliamentary Data
Paper
β’
2509.19270
β’
Published
This model is a fine-tuned version of openai/whisper-medium.
It is adapted for Slovak ASR using SloPalSpeech: 2,806 hours of aligned, β€30 s speechβtext pairs from official plenary sessions of the Slovak National Council.
| Dataset | Base WER | Fine-tuned WER | Ξ (abs) |
|---|---|---|---|
| Common Voice 21 (sk) | 38.0 | 18.0 | -20.0 |
| FLEURS (sk) | 18.7 | 7.6 | -11.1 |
Numbers from the paperβs final benchmark runs.
For more details, please see our paper on arXiv. If you use this model in your work, please cite it as:
@misc{boΕΎΓk2025slopalspeech2800hourslovakspeech,
title={SloPalSpeech: A 2,800-Hour Slovak Speech Corpus from Parliamentary Data},
author={Erik BoΕΎΓk and Marek Ε uppa},
year={2025},
eprint={2509.19270},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2509.19270},
}
This work was supported by VΓB Banka who provided the GPU resources and backing necessary to accomplish it, enabling progress in Slovak ASR research.