kon g's picture

1 5 6

kon g

dolemole

·

AI & ML interests

None yet

Organizations

None yet

dolemole's activity

upvoted 4 papers 12 months ago

HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis

Paper • 2311.12454 • Published Nov 21, 2023 • 29

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Paper • 2305.06908 • Published May 11, 2023 • 5

Controllable Human-Object Interaction Synthesis

Paper • 2312.03913 • Published Dec 6, 2023 • 22

Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis

Paper • 2312.03491 • Published Dec 6, 2023 • 34

upvoted a collection about 1 year ago

HuBERT

A collection of checkpoints from the HuBERT release, a speech encoder that learns powerful representations from unlabelled audio data. • 6 items • Updated Jan 16 • 5