ⓍTTS is a Voice generation model that was finetuned to clone the voice of a cartoon character with aproximately 3 minutes of audio.
Spanish (es)
Base model