New Open Source TTS Model (OuteTTS)

#66
by ecyht2 - opened

A new open source TTS model got released under CC-BY 4.0 License.

OuteAI/OuteTTS-0.1-350M

They also have GGUF version as well.

Here is the demo.

the model may frequently alter, insert, or omit wrong words, leading to variations in output quality.

IMHO, this makes it an invalid TTS.

the model may frequently alter, insert, or omit wrong words, leading to variations in output quality.

Yeah, I agree but I just want to document open-source TTS models.

I found a working space ameerazam08/OuteTTS-0.2-500M-Demo, it has to have ZeroGPU otherwise it will be very slow.

I also notice that you have to provide a voice sample that is an English speaker. Otherwise it will sound weird especially when pronouncing numbers. For example when you have a Chinese voice (the provided voices) the model will say the word 10 or ten as the same word but in Chinese.

Sign up or log in to comment