Wei Xiong

weqweasdas

AI & ML interests

Machine learning, RLHF

Recent Activity

liked a dataset about 16 hours ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
updated a dataset 9 days ago
weqweasdas/ep1_2
updated a dataset 9 days ago
weqweasdas/ep1_6

Organizations

weqweasdas's activity

New activity in RLHFlow/LLaMA3-SFT 2 months ago

LLaMA3.1-SFT

3
#3 opened 2 months ago by jackzhang
New activity in Qwen/Qwen2.5-Math-RM-72B 2 months ago

example to service the RM

1
#2 opened 2 months ago by weqweasdas
New activity in RLHFlow/LLaMA3-SFT 4 months ago
New activity in RLHF4MATH/Gemma-7B-it-SFT3epoch 4 months ago

Update README.md

#1 opened 4 months ago by weqweasdas
New activity in RLHFlow/ArmoRM-Llama3-8B-v0.1 4 months ago

Special tokens in the vocabulary?

4
#13 opened 4 months ago by nshen7
New activity in weqweasdas/RM-Mistral-7B 6 months ago

why vocab size is 32001

1
#3 opened 6 months ago by yechenzhi1
New activity in weqweasdas/RM-Mistral-7B 8 months ago

License

1
#2 opened 8 months ago by ravir123

Fix dataset link

#1 opened 8 months ago by ZennyKenny