Anwar Timilla

Timilla
Β·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Timilla's activity

Reacted to reach-vb's post with πŸ”₯ about 1 month ago
view post
Post
5377
Multimodal Ichigo Llama 3.1 - Real Time Voice AI πŸ”₯

> WhisperSpeech X Llama 3.1 8B
> Trained on 50K hours of speech (7 languages)
> Continually trained on 45hrs 10x A1000s
> MLS -> WhisperVQ tokens -> Llama 3.1
> Instruction tuned on 1.89M samples
> 70% speech, 20% transcription, 10% text
> Apache 2.0 licensed ⚑

Architecture:
> WhisperSpeech/ VQ for Semantic Tokens
> Llama 3.1 8B Instruct for Text backbone
> Early fusion (Chameleon)

I'm super bullish on HomeBrew/ Jan and early fusion, audio and text, multimodal models!

(P.S. Play with the demo on Hugging Face: jan-hq/Ichigo-llama3.1-s-instruct)
New activity in KwaiVGI/LivePortrait 5 months ago

Video to video

5
#9 opened 5 months ago by johnblues
New activity in KingNish/Instant-Video 6 months ago
New activity in chaishuaishuai/2 6 months ago

🚩 Report: Not working

#1 opened 6 months ago by Timilla