Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
12
8
16
Wei Xiong
weqweasdas
Follow
chengcheng22's profile picture
readysetgo's profile picture
dangkai-nk's profile picture
13 followers
·
2 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 models
about 2 months ago
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
Updated
30 days ago
•
708
•
18
RLHFlow/LLaMA3-SFT
Text Generation
•
Updated
May 23
•
5k
•
7
liked
2 models
4 months ago
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
Updated
Jun 12
•
4.67k
•
41
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
13 days ago
•
36.8k
•
134
liked
4 models
5 months ago
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
May 24
•
311
•
33
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
Updated
May 31
•
4
•
9
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
Updated
May 31
•
7
•
7
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
Updated
Jun 12
•
158
•
72
liked
3 models
6 months ago
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Apr 24
•
15.2k
•
46
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
Updated
Apr 24
•
60
•
8
weqweasdas/RM-Mistral-7B
Text Classification
•
Updated
Mar 31
•
2.84k
•
20
liked
a space
7 months ago
Running
231
📐
Reward Bench Leaderboard
liked
2 models
7 months ago
weqweasdas/RM-Gemma-7B
Text Classification
•
Updated
Mar 22
•
53
•
8
weqweasdas/RM-Gemma-2B
Text Classification
•
Updated
Mar 22
•
597
•
16
liked
a model
about 1 year ago
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25
•
364
•
16
liked
a space
over 1 year ago
Runtime error
66
🔥
Robin 7b