Zero-Shot Voice Cloning Collection TTS models that support zero-shot voice cloning • 7 items • Updated 9 days ago • 4
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated 16 days ago • 23
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 213
Audio Dialogues: Dialogues dataset for audio and music understanding Paper • 2404.07616 • Published Apr 11 • 15