Wei Xiong's picture

Wei Xiong

weqweasdas

·

https://weixiongust.github.io/WeiXiongUST/index.html

AI & ML interests

Machine learning, RLHF

Recent Activity

liked a dataset about 15 hours ago

RLHFlow/RLHFlow-SFT-Dataset-ver2

updated a dataset 9 days ago

weqweasdas/ep1_2

updated a dataset 9 days ago

weqweasdas/ep1_6

Organizations

weqweasdas's activity

liked a dataset about 15 hours ago

RLHFlow/RLHFlow-SFT-Dataset-ver2

Viewer • Updated 20 days ago • 2.32M • 74 • 3

liked a model 11 days ago

RLHFlow/Llama3.1-8B-PRM-Mistral-Data

Text Generation • Updated 13 days ago • 140 • 5

liked 2 models 3 months ago

NCSOFT/Llama-3-OffsetBias-RM-8B

Text Classification • Updated Sep 6 • 458 • 20

RLHFlow/LLaMA3-SFT

Text Generation • Updated 18 days ago • 5.81k • 7

liked 6 models 6 months ago

RLHFlow/LLaMA3-iterative-DPO-final

Text Generation • Updated Oct 14 • 7.38k • 40

RLHFlow/ArmoRM-Llama3-8B-v0.1

Text Classification • Updated Sep 23 • 9.6k • 153

RLHFlow/pair-preference-model-LLaMA3-8B

Text Generation • Updated Oct 14 • 2.21k • 36

Salesforce/LLaMA-3-8B-SFR-RM-R

Text Classification • Updated May 31 • 19 • 10

Salesforce/LLaMA-3-8B-SFR-SFT-R

Text Generation • Updated May 31 • 6 • 7

Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R

Text Generation • Updated Jun 12 • 390 • 74

liked 2 models 7 months ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • Updated Oct 14 • 14.1k • 48

sfairXC/FsfairX-Zephyr-Chat-v0.1

Text Generation • Updated Apr 24 • 22 • 8

liked a model 8 months ago

weqweasdas/RM-Mistral-7B

Text Classification • Updated Mar 31 • 298 • 22

liked a Space 8 months ago

Reward Bench Leaderboard

liked 2 models 9 months ago

weqweasdas/RM-Gemma-7B

Text Classification • Updated Mar 22 • 64 • 8

weqweasdas/RM-Gemma-2B

Text Classification • Updated Mar 22 • 343 • 17

liked a model over 1 year ago

weqweasdas/hh_rlhf_rm_open_llama_3b

Text Classification • Updated Feb 25 • 484 • 16

liked a Space over 1 year ago

Robin 7b