TTS / Audio
- Runningπ
- Runtime error3π€
Voice Cloning
- Running5π
TTS for 1,100+ Languages
Text-to-Speech, Speech-to-Text, and Language Recognition
- Sleepingππ
H2O Wave Whisper
- PausedπΈ
XTTS
- Pausedπ΅
MusicGen
- Pausedπ
Seamless M4T v2
- Running3π
Voice Clone Simple
- RunningπΈ
CoquiTTS (Official)
- PausedΒ π¦
Parakeet TDT 1.1b
- PausedπΊ
Image to Music v2
- Runtime errorπ½
Whisper Speech X DreamTalk
- Pausedπ€
Canary 1b
- Pausedπ»
Audiogen
- Running1π€π£οΈ
π€π£οΈEZVoiceCloner
- Runningπ»
Music Playground
- Running1π
Whisper.cpp WASM
- RunningπποΈ
Video SoundFX
- Running2β‘
EZ Voice Clone
- Pausedπ
Whisper
- Sleepingπ
Faster Whisper Webui
- Build errorπ£οΈ
MetaVoice 1B
A demo of MetaVoice 1B, a new TTS model by MetaVoice.
- Sleepingπ€
OpenVoice
- Runningπ
Speech Recognition Vue
- Running1π©π»βπ»π£οΈ
SeamlessOnDevice
- Runningπ
Text To Speech Client
- Sleepingβ‘
Musiclang
- Sleepingπ΅
Ultimate Vocal Remover WebUI
- Build errorπ
RVC Inference HF
- Runningπ£οΈποΈ
Ratchet + Whisper (Next.js)
- Sleepingπ
Bark Simple
- Running1π»
Easy GUI (English)
- Paused1π
Video Dubbing
- Pausedπ₯
Create Your Own TTS Dataset
- Running1π
VoiceCraft
- Running2β¨
Faster Whisper Webui with translate
- Build errorππΊ
Aesthetic RVC Inference HF
- Running on Zero745π₯
Parler-TTS
High-fidelity Text-To-Speech
- Running1π΅
MusicGen Web
In-browser text-to-music w/ Transformers.js!
- Running1π
Text To Speech Client
- Runningπ΅
Semantic Audio Search w/ Transformers.js
- Runtime error1π
seewav-gui
- Running162π
Voice Clone Multilingual
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
- Running276πΆ
β AI Jukebox β
Generate music powered by AI
- Sleeping1ππ£οΈ
Edge TTS
- Pausedπ₯
Hum an idea β‘οΈ Music
- Sleeping4π₯
JARVIS
Voice Chat with JARVIS
- Runningπ£οΈποΈ
Ratchet + Whisper Locally
Run Whisper in Browser
- Runtime error1π
Clone Your Voice
- Running2π€
Real-time Whisper WebGPU
- Sleeping1π₯
Whisper-Auto-Subtitled-Video-Generator
- Running19π
XTTS Voice Clone on CPU
- Running1π£οΈ
ElevenLabs TTS
- Sleepingπ°
Rabbit TTS
TTS, STT
- Runningπ
Voice Chat AI
Voice chat with AI that has web access
- Runtime errorππ¦
MassivelyMultilingualTTS
- Pausedπ
Mars5 Space
- Running on Zero1.45kπ£οΈ
Voice Clone
- Running on Zero357π₯
Stable Audio Open Zero
- Runtime errorπ¨
Audio WebUI
- Sleepingπ
Transcribe Anything 2
- Running10π¦
Fastwhisper
- Runningπ
Local Text To Speech
- Sleepingπ₯
BeatManipulator
- Runningπ
Candle Whisper
- Runningπ
Whisper Timestamped
In-browser speech recognition w/ word-level timestamps
- Runningπ£οΈ
Whisper Speaker Diarization
- Runtime error362β‘
Whisper Webui
- Running229π
Faster Whisper Webui
- Runningπ
Openai Whisper Live Transcribe
- Sleepingπ»
Whisper Transcribe
- Runtime error1π
Efficient Audio Captioning
- Sleeping1π
Huggingartists
- Running7π
Edge TTS w/ More Options
- Sleepingπ’
Video To MP3
- Sleeping4π»
Media Downloader
easy download youtube audios with gradio
- Pausedπͺ©
MusiConGen
- RunningπΆ
Bark Voice Cloning
- Sleeping1πΈ
NeonAI Coqui AI TTS Plugin
- Running139π
Qwen2 Audio Instruct Demo
- Running38π
Doc To Dialogue
Transform a report or document into an interview/discussion
- Running on Zero108π»
Llama3.1 S V0.2 Checkpoint 2024 08 20
- Runtime errorππ©π»βπ€βπ©π»π©πΌβπ€βπ©π»π©πΌβπ€βπ©πΌπ©π½βπ€βπ©π»π©π½βπ€βπ©πΌπ©π½βπ€βπ©π½π©πΎβπ€βπ©π»π©πΎβπ€βπ©πΌπ©πΎβπ€βπ©π½π«π©π»βπ€βπ§π»π©πΌβπ€βπ§πΎπ©πΌβπ€βπ§πΌπ©π»βπ€βπ§π½π©π»βπ€βπ§πΎπ©πΏβπ€βπ©π½
Translate 100
- Runtime error5β‘
Mini Omni
- Sleeping1π
Groq Gradio Voice Assistant
- Paused1π½
FLUX GIFs
- Running on Zero212πΆ
OpenMusic
- Runningπ
Whisper Large V3 Turbo WebGPU
ML-powered speech recognition directly in your browser
- Sleeping1π
FreeTranscriptMaker
Convert audio to text with ease and accuracy.
- Running on Zero58π’
VoiceRestore
- Running28ποΈ
GPT-SoVITS-3s-cloning-free-TTS
- Pausedπ£
EzAudio
- Pausedπ£
EzAudio ControlNet
- Sleepingπ
Reverb ASR Demo
- Running on Zero1.31kπ£οΈ
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- PausedπΌπΆ
Midi Music Generator
- Running4π»
MP3 Transcribe
Whisper Transcribe MP3 files, use a GPU to convert faster!
- Pausedπ
VoiceRestore
- Pausedπ¬
Fish Agent
An end-to-end (e2e) Voice Language Model by Fish Audio.
- Pausedπ
AudioπΉSeparator
Vocal and background audio separator
- Pausedπ¨
EchoMimic
Audio-Driven Portrait Animations
- Running on Zero2πβ«
Audio SR
Fixed fork of the original audio sr!
- Pausedπ
Hertz Dev
base model for mono-channel completion
- Runningπ
OuteTTS 0.1 350M Demo