efficient-nlp
/

stt-1b-en_fr-quantized

Automatic Speech Recognition

Model card Files Files and versions

Moshi Streaming Speech-to-Text (Quantized)

This is a quantized version of Kyutai’s stt-1b-en_fr model. The original model is a 1B parameter streaming speech-to-text model for English and French. This fork contains the same model, quantized to Q8_0 and Q4_K GGUF formats for reduced memory usage and faster inference.

Downloads last month: 71

GGUF

Model size

1.0B params

Architecture

Hardware compatibility

Log In to add your hardware

We're not able to determine the quantization variants.

View all variants

Spaces using efficient-nlp/stt-1b-en_fr-quantized 3