Ramadhirra

rmdhirr

AI & ML interests

Hi! I mess around with text completion model.

Recent Activity

liked a dataset about 2 months ago
lemonilia/Herrsimian
liked a Space about 2 months ago
huggingface-projects/llama-3.2-vision-11B
View all activity

Organizations

None yet

rmdhirr's activity

liked a dataset about 2 months ago
liked a Space about 2 months ago
New activity in rmdhirr/Gluon-8B 2 months ago

Update config.json

#1 opened 3 months ago by rmdhirr
Reacted to lamhieu's post with ❤️ 5 months ago
view post
Post
2893
Wow, this is amazing! 🤯
Samba is a powerful hybrid model with an unlimited context length, combining Mamba, MLP, Sliding Window Attention, and MLP stacking. Samba largest version, Samba-3.8B, trained on 3.2 trillion tokens, excels in benchmarks like MMLU, GSM8K, and HumanEval, and shines in long-context tasks with minimal tuning.
---
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
Github: https://github.com/microsoft/Samba
updated a Space 6 months ago
New activity in open-llm-leaderboard-old/requests 7 months ago