https://arxiv.org/abs/2408.16532
jishengpeng
novateur
AI & ML interests
speech language model, discrete codec, text to speech
Organizations
Collections
1
Papers
1
models
7
novateur/WavTokenizer-large-speech-75token
Updated
•
4
novateur/WavTokenizer-large-unify-40token
Updated
•
3
novateur/speech_dataset
Updated
•
1
novateur/WavTokenizer
Text-to-Speech
•
Updated
•
43
novateur/WavTokenizer-medium-speech-75token
Updated
novateur/WavTokenizer-medium-music-audio-75token
Updated
•
4
novateur/WavTokenizer-large-unify-75token
Updated
•
2
datasets
None public yet