Prompt length vs generation? Voice becomes scary at 200 tokens/charactes

#3
by fsaudm - opened

Any similar experiences? The voice quality degrades significantly when the text sequence passed increase. Some other times, it would just not do the entire sequence, and cut it at random places.

Yes, even after 100 tokens. I solved the error by generating voice sentence by sentence and combining them finally: https://huggingface.co/spaces/emirhanbilgic/read-my-pdf-outloud/blob/main/app.py
for the voice consistency, you can use the names (Gary, Jon, etc.)

Nice! Yeah I thought about doing something along those lines, I will definitely check it out :D

Sign up or log in to comment