Ougrid Dumdang

Ougrid-D

ougrid

AI & ML interests

None yet

Recent Activity

liked a Space 7 days ago

ThaiLLM-Leaderboard/leaderboard

liked a dataset 10 days ago

wikimedia/wikipedia

liked a dataset 11 days ago

allenai/openbookqa

View all activity

Organizations

None yet

Ougrid-D's activity

liked a Space 7 days ago

Running

🥇

Leaderboard

liked a dataset 10 days ago

wikimedia/wikipedia

Viewer • Updated Jan 9 • 61.6M • 56.9k • 610

liked a dataset 11 days ago

allenai/openbookqa

Viewer • Updated Jan 4 • 11.9k • 38.8k • 79

liked a model 29 days ago

jinaai/jina-embeddings-v3

Feature Extraction • Updated 12 days ago • 1.03M • 507

liked a Space 29 days ago

Running

📝

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

upvoted an article about 1 month ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 203

upvoted a paper about 1 month ago

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

Paper • 2410.10814 • Published Oct 14 • 48

liked a dataset about 2 months ago

airesearch/CMDF_VISTEC

Updated Jul 4 • 60 • 4

upvoted a collection about 2 months ago

Datasets for Pretrained Thai LLM

Collection

List Datasets for pretrained Thai LLM by PyThaiNLP • 23 items • Updated Sep 12 • 9

liked a dataset about 2 months ago

airesearch/WangchanThaiInstruct

Viewer • Updated 19 days ago • 25k • 337 • 23

liked a model about 2 months ago

meta-llama/Llama-3.2-3B-Instruct

Text Generation • Updated 30 days ago • 970k • • 659

upvoted a collection about 2 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 30 days ago • 488

upvoted an article 2 months ago

Article

Illustrated LLM OS: An Implementational Perspective

•

Dec 3, 2023

• 15

upvoted 3 articles 3 months ago

Article

Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning

•

Feb 20

• 16

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19

• 73

Article

Perspectives for first principles prompt engineering

•

Aug 18

• 16

liked a Space 3 months ago

Running on CPU Upgrade

162

🥇

MMLU Pro

More advanced and challenging multi-task evaluation

liked 2 datasets 3 months ago

TIGER-Lab/MMLU-Pro

Viewer • Updated Oct 18 • 12.1k • 29.8k • 286

openlifescienceai/medmcqa

Viewer • Updated Jan 4 • 193k • 3.74k • 120

upvoted a paper 3 months ago

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Paper • 2408.08274 • Published Aug 15 • 12