Open-source speech datasets annotated using Data-Speech
Open-source annotated speech datasets ranging from 1,000 hours to 45,000 hours.
Viewer • Updated • 10.8M • 4.16k • 13Note The English version of the Multilingual LibriSpeech (MLS) dataset.
parler-tts/libritts_r_filtered
Viewer • Updated • 359k • 717 • 8Note Filtered version of the 1K high-quality LibriTTS-R dataset.
parler-tts/mls-eng-speaker-descriptions
Viewer • Updated • 10.8M • 199 • 1Note Annotations of English MLS above. Used for v1 training.
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer • Updated • 359k • 176 • 3Note Annotations of the filtered LibriTTS-R dataset. Used for v1 training.
- Running on Zero745🥖
Parler-TTS
High-fidelity Text-To-Speech
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
Paper • 2402.01912 • Published • 11
mythicinfinity/libritts_r
Viewer • Updated • 756k • 4.33k • 24Note A 1K hours high-quality English speech dataset.
parler-tts/mls_eng_10k
Viewer • Updated • 2.43M • 1.02k • 21Note A 10K hours subset of the English version of the Multilingual LibriSpeech (MLS) dataset.
parler-tts/mls-eng-10k-tags_tagged_10k_generated
Viewer • Updated • 2.43M • 80 • 17Note Annotations of the 10K hours subset of English MLS above. Used for v0.1 training.
parler-tts/libritts_r_tags_tagged_10k_generated
Viewer • Updated • 365k • 128 • 7Note An annotated version of LibriTTS-R above. Used for v0.1 training.
parler-tts/parler_tts_mini_v0.1
Text-to-Speech • Updated • 23.6k • 346Note A first model iteration of Parler-TTS, trained using the 10k hours of narrated audiobooks above.