Aidy Osu
AI & ML interests
Recent Activity
Organizations
aidystark's activity
Thanks very much for the resource
I will reach out
๐ Run it locally in-browser for private transcriptions! Transcribe interviews, audio & video.
โก๏ธ 40 tokens/sec on my MacBook
๐ Try it: webml-community/whisper-large-v3-turbo-webgpu
Model: https://huggingface.co/ylacombe/whisper-large-v3-turbo
Okay...
The Language is spoken in Africa (Nigeria) but it is part of the Niger Congo Language Family it is called "Ibibio". We are just capturing the language for the first time digitally . We will try to get it to 60k Hours for a start. What i saw was that if the language is not captured in Flan T5 then the training of Parler TTS should be done from scratch; what's your intuition on this.
Hugging Face's Parler TTS mini can now speak French! ๐ซ๐ท๐
You can try it here: PHBJT/french_parler_tts
Key highlights:
Transform the English TTS model to speak French ๐ฌ๐งโก๏ธ๐ซ๐ท
Fully open source (code, weights, and datasets) ๐ ๏ธ
It can be replicated for every language ๐
Read more about it in this article: https://huggingface.co/blog/PHBJT/french-parler-tts
Special thanks to FlexAI and their dedicated team for providing the computing power that made this possible and of course to all of the Parler TTS community ๐ค
Hi.. this is amazing thanks for this
I want to use some advice from you for an endangered/low resource Language (35hrs) which is not too bad for such class of Language
Is it a really stringent that it must be meet this criteria ( minimum: 100 hours of audio (up to 1000 hours for optimal results))
I don't hope for perfection for a start and how do I handle cases where my tokenizer is not captured in FlanT5. Thanks