--- license: openrail --- Youtube: smotto_ai Tiktok: smotto_ai RVC v2 model Don't forget to credit me @Smotto if you do use this model! Data - English Talking style taken from her live podcast. Cut and clean them manually, but didn't bother keeping clips with any background noise (too lazy to edit them out between words). - 61.6 MB of data == 5 min and 36 seconds - 48khz 16bit-depth audio files Processing - Split audio clips using whisperX - Kim vocal 1 -> Reverb HQ -> Karaoke 2 (if * needed) -> DeEcho -> Denoise Hyper-parameters - mangio-crepe - 6 batch size - 16 pitch extraction hop-length - 300 epochs