| | --- |
| | license: mit |
| | language: |
| | - en |
| | pipeline_tag: text-to-speech |
| | tags: |
| | - Realtime TTS |
| | - Streaming text input |
| | - Long-form speech generation |
| | - mlx |
| | library_name: transformers |
| | base_model: |
| | - Qwen/Qwen2.5-0.5B |
| | --- |
| | |
| | # mlx-community/VibeVoice-Realtime-0.5B-8bit |
| | This model was converted to MLX format from [`microsoft/VibeVoice-Realtime-0.5B`](https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B) using mlx-audio version **0.2.6**. |
| | Refer to the [original model card](https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B) for more details on the model. |
| | ## Use with mlx |
| |
|
| | ```bash |
| | pip install -U mlx-audio |
| | ``` |
| |
|
| | ```bash |
| | python -m mlx_audio.tts.generate --model mlx-community/VibeVoice-Realtime-0.5B-8bit --text "Hello, this is VibeVoice real-time 0.5B model." --voice en-Emma_woman |
| | ``` |
| |
|