arxiv:2405.11143
Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
liked
a model
17 days ago
O1-OPEN/OpenO1-LLama-8B-v0.1
updated
a model
23 days ago
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
New activity
23 days ago
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
Organizations
Papers
1
models
None public yet
datasets
None public yet