Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
10
18
Wei Xiong
weqweasdas
Follow
Trangle's profile picture
readysetgo's profile picture
chengcheng22's profile picture
14 followers
·
2 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
liked
a dataset
about 16 hours ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
updated
a dataset
9 days ago
weqweasdas/ep1_2
updated
a dataset
9 days ago
weqweasdas/ep1_6
View all activity
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
RLHFlow/LLaMA3-SFT
2 months ago
LLaMA3.1-SFT
3
#3 opened 2 months ago by
jackzhang
New activity in
Qwen/Qwen2.5-Math-RM-72B
2 months ago
example to service the RM
1
#2 opened 2 months ago by
weqweasdas
New activity in
RLHFlow/LLaMA3-SFT
3 months ago
How to use llama 3sft model, pipeline or tokenizer.apply_chat_template. Can you provide a simple example? Thank you very much for your contribution
2
#2 opened 3 months ago by
ZHIYII
New activity in
RLHFlow/LLaMA3-SFT
4 months ago
Missing BOS token in tokenized text
2
#1 opened 4 months ago by
ZhaofengWu
New activity in
RLHF4MATH/Gemma-7B-it-SFT3epoch
4 months ago
Update README.md
#1 opened 4 months ago by
weqweasdas
New activity in
RLHFlow/ArmoRM-Llama3-8B-v0.1
4 months ago
Special tokens in the vocabulary?
4
#13 opened 4 months ago by
nshen7
New activity in
sfairXC/FsfairX-LLaMA3-RM-v0.1
5 months ago
TypeError: Got unsupported ScalarType BFloat16
1
#5 opened 5 months ago by
AIR-hl
New activity in
RLHFlow/pair-preference-model-LLaMA3-8B
5 months ago
Could you please test the consistency of preference between `RLHFlow/pair-preference-model-LLaMA3-8B` and GPT-4 on alpacaeval dataset?
1
#2 opened 5 months ago by
rungao2001
commented
a paper
6 months ago
RLHF Workflow: From Reward Modeling to Online RLHF
Paper
•
2405.07863
•
Published
May 13
•
67
•
5
New activity in
weqweasdas/RM-Mistral-7B
6 months ago
why vocab size is 32001
1
#3 opened 6 months ago by
yechenzhi1
New activity in
weqweasdas/RM-Mistral-7B
8 months ago
License
1
#2 opened 8 months ago by
ravir123
Fix dataset link
#1 opened 8 months ago by
ZennyKenny