BSC-LT
/

whisper-bsc-large-v3-cat

Automatic Speech Recognition

whisper-large-v3

barcelona-supercomputing-center

Eval Results (legacy)

Model card Files Files and versions

AbirMessaoudi commited on Oct 28, 2025

Commit

67dfbc8

·

verified ·

1 Parent(s): 65cc18f

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -232,7 +232,7 @@ import torch
 from transformers import WhisperForConditionalGeneration, WhisperProcessor
 #Load the processor and model.
-MODEL_NAME="langtech-veu/whisper-bsc-large-v3-cat"
 processor = WhisperProcessor.from_pretrained(MODEL_NAME)
 model = WhisperForConditionalGeneration.from_pretrained(MODEL_NAME).to("cuda")
@@ -277,12 +277,12 @@ The specific datasets used to create the model are:
 - [3CatParla](https://huggingface.co/datasets/projecte-aina/3catparla_asr). (Soon to be published)
 - [commonvoice_benchmark_catalan_accents](https://huggingface.co/datasets/projecte-aina/commonvoice_benchmark_catalan_accents)
 - [corts_valencianes](https://huggingface.co/datasets/projecte-aina/corts_valencianes_asr_a) (Only the anonymized version of the dataset is public. We trained the model with the non-anonymized version.)
-- [parlament_parla_v3](https://huggingface.co/datasets/projecte-aina/parlament_parla_v3)
 - [IB3](https://huggingface.co/datasets/projecte-aina/ib3_ca_asr) (Soon to be published)
 ### Training procedure
-This model is the result of finetuning the model ["openai/whisper-large-v3"](https://huggingface.co/openai/whisper-large-v3) by following this [tutorial](https://github.com/langtech-bsc/whisper_ft_pipeline) provided by [Language Technologies Laboratory](https://huggingface.co/BSC-LT). (Soon to be published)
 ### Training Hyperparameters

 from transformers import WhisperForConditionalGeneration, WhisperProcessor
 #Load the processor and model.
+MODEL_NAME="BSC-LT/whisper-bsc-large-v3-cat"
 processor = WhisperProcessor.from_pretrained(MODEL_NAME)
 model = WhisperForConditionalGeneration.from_pretrained(MODEL_NAME).to("cuda")
 - [3CatParla](https://huggingface.co/datasets/projecte-aina/3catparla_asr). (Soon to be published)
 - [commonvoice_benchmark_catalan_accents](https://huggingface.co/datasets/projecte-aina/commonvoice_benchmark_catalan_accents)
 - [corts_valencianes](https://huggingface.co/datasets/projecte-aina/corts_valencianes_asr_a) (Only the anonymized version of the dataset is public. We trained the model with the non-anonymized version.)
+- [parlament_parla_v3](https://huggingface.co/datasets/projecte-aina/parlament_parla_v3) (Only the anonymized version of the dataset is public. We trained the model with the non-anonymized version.)
 - [IB3](https://huggingface.co/datasets/projecte-aina/ib3_ca_asr) (Soon to be published)
 ### Training procedure
+This model is the result of fine-tuning the model ["openai/whisper-large-v3"](https://huggingface.co/openai/whisper-large-v3) by following this [tutorial](https://github.com/langtech-bsc/whisper_ft_pipeline) provided by [Language Technologies Laboratory](https://huggingface.co/BSC-LT). (Soon to be published)
 ### Training Hyperparameters