Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Model Tree
Base
Adapters
Finetunes
Quantizations
Merges
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
JoyFusion
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Unsloth Studio
Pi
Hermes Agent
Inference Providers
Select all
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
DeepInfra
Misc
Reset Misc
text-to-audio
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
custom_code
Merge
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results
Apply filters
Models
4,769
Base only
Inference Available
Inference
Edit filters
Sort: Trending
Active filters:
text-to-audio
Clear all
google/magenta-realtime-2
Text-to-Audio
•
Updated
about 18 hours ago
•
2.25k
•
70
Lightricks/LTX-2.3
Image-to-Video
•
Updated
Apr 13
•
2.29M
•
1.33k
unsloth/LTX-2.3-GGUF
Image-to-Video
•
21B
•
Updated
Apr 20
•
338k
•
444
stabilityai/stable-audio-3-medium
Text-to-Audio
•
2B
•
Updated
16 days ago
•
43.4k
•
151
OpenMOSS-Team/MOSS-SoundEffect-v2.0
Text-to-Audio
•
Updated
10 days ago
•
290
•
45
Lightricks/LTX-2
Image-to-Video
•
Updated
Mar 2
•
614k
•
•
1.74k
vantagewithai/Sulphur-2-Base-GGUF
Image-to-Video
•
21B
•
Updated
about 1 month ago
•
96.3k
•
80
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any
•
35B
•
Updated
Sep 22, 2025
•
1.59M
•
935
tencent/SongGeneration
Text-to-Audio
•
Updated
Mar 2
•
572
•
347
jac22/UNISON
Text-to-Audio
•
Updated
2 days ago
•
11
•
5
ACE-Step/acestep-v15-xl-base-diffusers
Text-to-Audio
•
Updated
2 days ago
•
33
•
5
stabilityai/stable-audio-open-small
Text-to-Audio
•
0.5B
•
Updated
May 27, 2025
•
3.74k
•
263
bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
•
6B
•
Updated
5 days ago
•
162k
•
678
ACE-Step/Ace-Step1.5
Text-to-Audio
•
Updated
Feb 3
•
49.1k
•
764
Lightricks/LTX-2.3-fp8
Image-to-Video
•
Updated
Mar 16
•
1.17M
•
106
stabilityai/stable-audio-3-small-sfx
Text-to-Audio
•
0.6B
•
Updated
17 days ago
•
7.04k
•
47
ACE-Step/acestep-v15-xl-sft-diffusers
Text-to-Audio
•
Updated
2 days ago
•
92
•
4
sesame/csm-1b
Text-to-Speech
•
2B
•
Updated
Dec 1, 2025
•
257k
•
2.39k
vantagewithai/LTX-2.3-GGUF
Image-to-Video
•
21B
•
Updated
Mar 30
•
37.7k
•
23
ACE-Step/acestep-v15-xl-base
Text-to-Audio
•
5B
•
Updated
Apr 7
•
1.57k
•
82
microsoft/speecht5_tts
Text-to-Speech
•
Updated
Nov 8, 2023
•
118k
•
834
facebook/mms-tts-eng
Text-to-Speech
•
36.3M
•
Updated
Sep 6, 2023
•
147k
•
178
stabilityai/stable-audio-open-1.0
Text-to-Audio
•
1B
•
Updated
Jun 19, 2025
•
44.5k
•
1.48k
HKUSTAudio/AudioX
Text-to-Audio
•
Updated
Feb 10
•
132
Soul-AILab/SoulX-Singer
Text-to-Speech
•
Updated
Mar 13
•
692
•
157
HKUSTAudio/Audio-Omni
Any-to-Any
•
Updated
Apr 16
•
44
suno/bark
Text-to-Speech
•
Updated
Oct 4, 2023
•
18.2k
•
1.53k
facebook/mms-tts-uig-script_arabic
Text-to-Speech
•
36.3M
•
Updated
Sep 1, 2023
•
718
•
15
facebook/mms-tts-tha
Text-to-Speech
•
36.3M
•
Updated
Sep 1, 2023
•
9.55k
•
15
facebook/mms-tts-sna
Text-to-Speech
•
36.3M
•
Updated
Sep 1, 2023
•
249
•
2
Previous
1
2
3
...
100
Next