3 2 15

Yixuan Wei

EasonWei

weiyx16

AI & ML interests

None yet

Recent Activity

commented a paper 7 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

commented a paper 17 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

upvoted a paper 21 days ago

On Memorization of Large Language Models in Logical Reasoning

View all activity

Organizations

EasonWei's activity

commented a paper 7 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25 • 18 •

commented a paper 17 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25 • 18 •

upvoted a paper 21 days ago

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published 29 days ago • 16

liked 2 models 2 months ago

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Sep 18 • 719k • 1.23k

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.32M • • 6.76k

upvoted a collection 2 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated about 3 hours ago • 392

liked a dataset 3 months ago

edinburgh-dawg/mmlu-redux

Viewer • Updated Aug 9 • 3k • 1.13k • 26

New activity in 1bitLLM/bitnet_b1_58-3B 6 months ago

Is it bitnet {-1,0,1}?

#6 opened 8 months ago by

Remek

authored a paper 6 months ago

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30 • 17

authored a paper 9 months ago

Common 7B Language Models Already Possess Strong Math Capabilities

Paper • 2403.04706 • Published Mar 7 • 16

liked a model 11 months ago

adept/fuyu-8b

Image-Text-to-Text • Updated Nov 4, 2023 • 8.41k • 992

liked a dataset 11 months ago

fka/awesome-chatgpt-prompts

Viewer • Updated Sep 3 • 170 • 10k • 6.38k

liked a model about 1 year ago

01-ai/Yi-6B

Text Generation • Updated 17 days ago • 7.34k • 372

authored a paper about 1 year ago

FP8-LM: Training FP8 Large Language Models

Paper • 2310.18313 • Published Oct 27, 2023 • 31

liked a dataset about 1 year ago

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.62k • 337

New activity in OpenAssistant/reward-model-deberta-v3-large-v2 about 1 year ago

Validation split indices?

#6 opened over 1 year ago by

cmglaze

liked a model over 1 year ago

tiiuae/falcon-7b

Text Generation • Updated Oct 12 • 122k • 1.08k

liked a Space over 1 year ago

Running on CPU Upgrade

11.9k

🏆

Open LLM Leaderboard 2

Track, rank and evaluate open LLMs and chatbots

liked 2 models over 1 year ago

WizardLMTeam/WizardCoder-15B-V1.0

Text Generation • Updated Jan 19 • 2.29k • 746

bigscience/tr8-104B-logs

Updated Nov 30, 2021 • 5