HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Paper • 2311.12454 • Published Nov 21, 2023 • 29
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model Paper • 2305.06908 • Published May 11, 2023 • 5
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Paper • 2312.03491 • Published Dec 6, 2023 • 34
HuBERT Collection A collection of checkpoints from the HuBERT release, a speech encoder that learns powerful representations from unlabelled audio data. • 6 items • Updated Jan 16 • 5