google/fleurs
Viewer • Updated • 768k • 57.6k • 402
How to use bardsai/whisper-large-v2-pl-v2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="bardsai/whisper-large-v2-pl-v2") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("bardsai/whisper-large-v2-pl-v2")
model = AutoModelForSpeechSeq2Seq.from_pretrained("bardsai/whisper-large-v2-pl-v2")This model is a fine-tuned version of bardsai/whisper-large-v2-pl on the Common Voice 11.0 and the FLEURS datasets. It achieves the following results on the evaluation set:
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.0047 | 1.35 | 700 | 0.3428 | 8.5562 |
| 0.0011 | 2.7 | 1400 | 0.3605 | 7.5505 |
| 0.0003 | 4.05 | 2100 | 0.3684 | 7.2802 |