arxiv:2407.14329
Xuenan Xu
wsntxxn
AI & ML interests
Text to Speech Synthesis
Text to Music Synthesis
Singing Voice Synthesis
Organizations
None yet
Papers
10
models
7
wsntxxn/cnn14rnn-tempgru-audiocaps-captioning
Feature Extraction
•
Updated
•
4
•
1
wsntxxn/effb2-trm-audiocaps-captioning
Feature Extraction
•
Updated
•
56
•
1
wsntxxn/effb2-trm-clotho-captioning
Feature Extraction
•
Updated
•
78
•
1
wsntxxn/cnn8rnn-w2vmean-audiocaps-grounding
Audio Classification
•
Updated
•
185
•
2
wsntxxn/cnn8rnn-audioset-sed
Audio Classification
•
Updated
•
547
•
2
wsntxxn/audiocaps-simple-tokenizer
Updated
wsntxxn/clotho-simple-tokenizer
Updated
datasets
None public yet