weishen's picture

8 6 25

weishen

fakerbaby

·

fakerbaby

AI & ML interests

NLP, alignment, LLM

Recent Activity

liked a dataset 23 days ago

yingyingzhang/metamath-qwen2-math

liked a dataset 23 days ago

nvidia/OpenMathInstruct-2

liked a dataset 28 days ago

KbsdJames/Omni-MATH

View all activity

Organizations

fakerbaby's activity

liked 2 datasets 23 days ago

yingyingzhang/metamath-qwen2-math

Viewer • Updated Oct 1 • 467k • 197 • 14

nvidia/OpenMathInstruct-2

Viewer • Updated 3 days ago • 22M • 11.3k • 112

liked 2 datasets 28 days ago

KbsdJames/Omni-MATH

Viewer • Updated Oct 12 • 4.43k • 829 • 58

Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25 • 77k • 1.49k • 21

liked a dataset about 1 month ago

AI-MO/aimo-validation-aime

Viewer • Updated Jul 10 • 90 • 700 • 13

Reacted to onekq's post with 👍 2 months ago

Post

2551

Here is my latest study on OpenAI🍓o1🍓.
A Case Study of Web App Coding with OpenAI Reasoning Models (2409.13773)

I wrote an easy-to-read blogpost to explain finding.
https://huggingface.co/blog/onekq/daily-software-engineering-work-reasoning-models

INSTRUCTION FOLLOWING is the key.

100% instruction following + Reasoning = new SOTA

But if the model misses or misunderstands one instruction, it can perform far worse than non-reasoning models.

upvoted a collection 2 months ago

Infinity Instruct

16 items • Updated Oct 24 • 6

liked 3 datasets 2 months ago

Magpie-Align/MagpieLM-SFT-Data-v0.1

Viewer • Updated Sep 18 • 550k • 135 • 15

MARIO-Math-Reasoning/Gaokao2023-Math-En

Viewer • Updated Jun 1 • 385 • 35 • 5

hfl/stem_zh_instruction

Viewer • Updated May 13 • 256k • 165 • 22

liked 2 Spaces 2 months ago

Qwen2.5

Chat-with-OpenAI-o1

upvoted a collection 3 months ago

DeepSeekCoder-V2

6 items • Updated Sep 5 • 83

liked a Space 3 months ago

Big Code Models Leaderboard

liked 5 datasets 3 months ago

BAAI/TACO

Updated Jun 19 • 1.21k • 71

BAAI/Infinity-Preference

Viewer • Updated Aug 30 • 59.4k • 171 • 62

argilla/magpie-ultra-v0.1

Viewer • Updated 2 days ago • 50k • 444 • 216

AI-MO/NuminaMath-CoT

Viewer • Updated 3 days ago • 860k • 2.51k • 246

liwu/MNBVC

Updated Aug 23 • 19.5k • 489

liked a dataset 4 months ago

cognitivecomputations/oa_leet10k

Viewer • Updated Apr 15, 2023 • 23.4k • 54 • 24