metadata
license: mit
language:
- en
pipeline_tag: text-to-speech
tags:
- Realtime TTS
- Streaming text input
- Long-form speech generation
- mlx
library_name: transformers
base_model:
- Qwen/Qwen2.5-0.5B
mlx-community/VibeVoice-Realtime-0.5B-8bit
This model was converted to MLX format from microsoft/VibeVoice-Realtime-0.5B using mlx-audio version 0.2.6.
Refer to the original model card for more details on the model.
Use with mlx
pip install -U mlx-audio
python -m mlx_audio.tts.generate --model mlx-community/VibeVoice-Realtime-0.5B-8bit --text "Hello, this is VibeVoice real-time 0.5B model." --voice en-Emma_woman