3 11 40

Aidy Osu

aidystark

AI & ML interests

Vision;Language;Speech

Recent Activity

liked a model 10 days ago

fixie-ai/ultravox-v0_4_1-llama-3_1-8b

upvoted a collection 10 days ago

UltraVox Audio Language Model Release 🔊

liked a Space 10 days ago

echo840/ocrbench-leaderboard

View all activity

Organizations

aidystark's activity

liked a model 10 days ago

fixie-ai/ultravox-v0_4_1-llama-3_1-8b

Feature Extraction • Updated 14 days ago • 2.27k • 67

upvoted a collection 10 days ago

UltraVox Audio Language Model Release 🔊

Collection

3 items • Updated 13 days ago • 15

liked a Space 10 days ago

Running

🏆

genmo/mochi-1-preview

Text-to-Video • Updated 6 days ago • 45.6k • 1.03k

liked a Space 23 days ago

Running on Zero

🗣️

Multi Parler-TTS

High-fidelity Text-To-Speech

replied to PHBJT's post 23 days ago

Thanks very much for the resource
I will reach out

updated a dataset about 1 month ago

aidystark/EFIK_7K

Viewer • Updated Oct 24 • 7.98k • 36

liked a Space about 2 months ago

Running on Zero

👁

coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 1.78M • 2.01k

Reacted to fdaudens's post with 🧠👍 about 2 months ago

Post

978

🚀 OpenAI's new Whisper "turbo": 8x faster, 40% VRAM efficient, minimal accuracy loss.
🔒 Run it locally in-browser for private transcriptions! Transcribe interviews, audio & video.
⚡️ 40 tokens/sec on my MacBook

🔗 Try it: webml-community/whisper-large-v3-turbo-webgpu
Model: https://huggingface.co/ylacombe/whisper-large-v3-turbo

liked a Space 2 months ago

Running on Zero

🌖

Skywork-o1-Open-Llama3.1-8B

Chat with Skywork-o1-Open-Llama3.1-8B

replied to PHBJT's post 2 months ago

Okay...
The Language is spoken in Africa (Nigeria) but it is part of the Niger Congo Language Family it is called "Ibibio". We are just capturing the language for the first time digitally . We will try to get it to 60k Hours for a start. What i saw was that if the language is not captured in Flan T5 then the training of Parler TTS should be done from scratch; what's your intuition on this.

Reacted to PHBJT's post with 🔥❤️ 2 months ago

Post

1714

Bringing Open-Source Text-to-Speech to French! 🗣️🇫🇷

Hugging Face's Parler TTS mini can now speak French! 🇫🇷🎉
You can try it here: PHBJT/french_parler_tts

Key highlights:
Transform the English TTS model to speak French 🇬🇧➡️🇫🇷
Fully open source (code, weights, and datasets) 🛠️
It can be replicated for every language 🌍

Read more about it in this article: https://huggingface.co/blog/PHBJT/french-parler-tts

Special thanks to FlexAI and their dedicated team for providing the computing power that made this possible and of course to all of the Parler TTS community 🤗

8 replies

replied to PHBJT's post 2 months ago

Hi.. this is amazing thanks for this
I want to use some advice from you for an endangered/low resource Language (35hrs) which is not too bad for such class of Language
Is it a really stringent that it must be meet this criteria ( minimum: 100 hours of audio (up to 1000 hours for optimal results))
I don't hope for perfection for a start and how do I handle cases where my tokenizer is not captured in FlanT5. Thanks