Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
1
2
Zhaopeng Feng
fzp0424
Follow
fzp0424
AI & ML interests
None yet
Organizations
fzp0424
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
upvoted
a
paper
2 months ago
DPO Meets PPO: Reinforced Token Optimization for RLHF
Paper
•
2404.18922
•
Published
Apr 29
•
1