Audio Spaces
- 70π
- 951
Seamless M4T
π - 4.73k
MusicGen
π΅Generate music from text and melody descriptions
- 798
Audioldm Text To Audio Generation
πGenerate audio from text
- 299
AudioLDM2 Text2Audio Text2Music Generation
πGenerate a video waveform from text-based audio descriptions
- 221
AudioSep
π - 157
Lp Music Caps
π΅Create music captions from audio files
- 265
Tortoise Tts
π’ExpressivText-to-Speech
- 16
All In One
π - 2.31k
XTTS
πΈ - 190
Coqui Bark Voice Cloning
πΈ - 355
VALL E X
πGenerate audio from text with a custom voice
- 192
WavJourney
π₯ - 265
Music To Image
πΆ - 279
MMS
πTransform and identify speech with MMS
- 556
ElevenLabs TTS
π£Generate realistic voices from text
- 288
AudioGPT
π - 2.14k
Bark
πΆGenerate realistic audio from text
- 36
SpeechT5 Speech Recognition Demo
π© - 172
CoquiTTS (Official)
πΈ - 1.98k
Whisper
πTranscribe or translate audio from files, microphone, or YouTube
- 615
Moe TTS
πGenerate and convert speech using text and audio inputs
- 17
YourTTS
π₯ - 543
Talking Face Generation with Multilingual TTS
πGenerate a talking face video from text
- 563
OpenAI TTS New
π - 168
Mustango
π’ - 55
OWSM Demo
π - 614
StyleTTS 2
π£Efficient, fast, and natural text to speech with StyleTTS 2!
- 372
HierSpeech++ (Zero-shot TTS)
β‘Generate high-quality speech from text using a prompt audio
- 20
Video2music
π - 187
Whisper Large V2
π€« - 59
Musicgen Prompt Upsampling
πGenerate music from text prompts πΆ
- 62
Qwen-Audio
π€Interact with a chatbot using text and audio
- 517
Seamless M4T v2
π - 259
Seamless Streaming
πTranslate text into different languages
- 48
Matcha TTS
π΅Generate speech from text input
- 252
MusicGen Streaming
π₯Generate music from text prompts
- 311
Resemble Enhance
πEnhance and clean audio files
- 242
Singing Voice Conversion
πΌTransform your voice into a singer's
- 50
NaturalSpeech2
π§ - 21
Create Your Own TTS Dataset
π₯ Podcast Transcription
π’- 1.02k
OpenVoice
π€ - 95
M2UGen Demo
π» - 70
Pheme
π - 5
ESPnet2 TTS
πGenerate speech from text in multiple languages
- 16
Whisper-WebUI
πGenerate subtitles and translate them
- 171
Image2SFX Comparison
πGenerates audio environment from an image
- 382
WhisperSpeech
π¬ - 146
MetaVoice 1B
π£A demo of MetaVoice 1B, a new TTS model by MetaVoice.
- 620
TTS Arena
πVote on the latest TTS models!
- 171
Whisper Speech X DreamTalk
π½Combine voice cloning and portrait lipsync animation
- 190
Canary 1b
π€Transcribe and translate audio into text
- 75
SALMONN Audio Questioning
β‘Deeply interrogate audio file content
- 420
MeloTTS
π£Fast, efficient, & multilingual text-to-speech
- 275
Audio Editing
π§Edit audios with text prompts
- 18
ChatMusician
π» - 68
xVASynth TTS
π§CPU powered, low RTF, emotional, multilingual TTS
- 174
NaturalSpeech3 FACodec
πConvert and reconstruct speech files
- 24
Hey Gemma
β - 69
Ratchet + Whisper
π£ - 3
AutoSubs
πAutomatically add on-screen subs to your videos
- 162
VoiceCraft
π - 276
TangoFlux
πText to Audio (Sound SFX) Generator
- 790
Parler-TTS
π₯High-fidelity Text-To-Speech
- 182
Sing an idea β‘οΈ Music
π₯Bring song ideas to life
- 65
Musicgen Songstarter Demo
πGenerate music using descriptions and optional melody audio
- 97
Whisper JAX
πTranscribe or translate audio from microphone, file, or YouTube
- 20
AudioLCM
π’Generate audio from text
- 159
Stable Audio Live Multiplayer
π»Generate audio from text prompts
- 394
Stable Audio Open Zero
π₯Generate audio from text prompts
- 13
Make An Audio 3
πGenerate audio from text
- 60
Mars5 Space
π - 5
Tango Music AF
π΅Text to Music Generator
- 96
BigVGAN
πGenerate high-fidelity audio from input audio waveforms
- 80
SenseVoice
πTranscribe audio with emotions and events
- 58
CosyVoice 300M
π - 24
PicoAudio
πGenerate audio from text descriptions
- 6
Audio Flamingo Demo
π - 29
MusiConGen
πͺ© - 15
Mms Zeroshot
πGenerate transcript from audio input
- 151
Qwen2 Audio Instruct Demo
πChat with a bot using text and audio
- 115
GPT SoVITS V2
π€Generate speech from text with reference audio
- 260
EzAudio
π£Generate and edit audio from text prompts
- 216
OpenMusic
πΆGenerate high-quality music from text descriptions
- 478
Midi Music Generator
πΌGenerate MIDI music from prompts
- 758
Whisper Turbo
π€―Transcribe or translate audio and YouTube videos
- 285
Realtime Whisper Turbo
π€―Realtime implementation of Whisper large turbo
- 143
Whisper Large V3 Turbo WebGPU
πML-powered speech recognition directly in your browser
- 433
Fish Speech 1
π - 282
TTS Spaces Arena
π€Blind vote on HF TTS models!
- 16
Diva Realtime Chat
π£Convert spoken words to text and voice assistant responses
- 1.79k
F5-TTS
π£F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- 249
MaskGCT TTS Demo
π»MaskGCT TTS Demo
- 70
MelodyFlow
π΅Generate music from text and melody
- 135
Fish Agent
π¬An end-to-end (e2e) Voice Language Model by Fish Audio.
- 56
Nexa Omni Demo
π§Generate text from audio input
- 148
CosyVoice2-0.5B
π₯³Generate realistic voice audio from text and audio prompts
- 1.87k
Kokoro TTS
β€Upgraded to v1.0!
- 83
Make Custom Voices With KokoroTTS
β‘Make Custom Voices With KokoroTTS
- 248
Llasa 3b Tts
π₯Zero Shot voice cloning with llasa 3b (Unofficial Demo)