| | --- |
| | base_model: facebook/wav2vec2-xls-r-300m |
| | language: |
| | - uk |
| | license: "apache-2.0" |
| | tags: |
| | - automatic-speech-recognition |
| | datasets: |
| | - mozilla-foundation/common_voice_10_0 |
| | metrics: |
| | - wer |
| | model-index: |
| | - name: w2v-xls-r-uk |
| | results: |
| | - task: |
| | name: Automatic Speech Recognition |
| | type: automatic-speech-recognition |
| | dataset: |
| | name: common_voice_10_0 |
| | type: common_voice_10_0 |
| | config: uk |
| | split: test |
| | args: uk |
| | metrics: |
| | - name: WER |
| | type: wer |
| | value: 20.24 |
| | - name: CER |
| | type: cer |
| | value: 3.64 |
| | --- |
| | |
| | π¨π¨π¨ **ATTENTION!** π¨π¨π¨ |
| |
|
| | **Use an updated model**: https://huggingface.co/Yehor/w2v-bert-uk-v2.1 |
| |
|
| | --- |
| |
|
| | ## Community |
| |
|
| | - Discord: https://bit.ly/discord-uds |
| | - Speech Recognition: https://t.me/speech_recognition_uk |
| | - Speech Synthesis: https://t.me/speech_synthesis_uk |
| |
|
| | See other Ukrainian models: https://github.com/egorsmkv/speech-recognition-uk |
| |
|
| | ## Evaluation results |
| |
|
| | Metrics (float16) using `evaluate` library with `batch_size=1`: |
| |
|
| | - WER: 0.2024 metric, 20.24% |
| | - CER: 0.0364 metric, 3.64% |
| | - Accuracy on words: 79.76% |
| | - Accuracy on chars: 96.36% |
| | - Inference time: 63.4848 seconds |
| | - Audio duration: 16665.5212 seconds |
| | - RTF: 0.0038 |
| |
|
| | ## Cite this work |
| |
|
| | ``` |
| | @misc {smoliakov_2025, |
| | author = { {Smoliakov} }, |
| | title = { w2v-xls-r-uk (Revision 55b6dc0) }, |
| | year = 2025, |
| | url = { https://huggingface.co/Yehor/w2v-xls-r-uk }, |
| | doi = { 10.57967/hf/4556 }, |
| | publisher = { Hugging Face } |
| | } |
| | ``` |
| |
|