Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
2
Arash Ahmadian
ArashAhmadian
Follow
mkiani3000's profile picture
shuyuej's profile picture
GigaBoy's profile picture
9 followers
·
0 following
aahmadian_
AI & ML interests
None yet
Articles
Putting RL back in RLHF
19 days ago
•
49
Organizations
Papers
3
arxiv:
2406.01660
arxiv:
2402.14740
arxiv:
2309.05444
models
12
Sort: Recently updated
ArashAhmadian/rloo_1B_tldr
Text Generation
•
Updated
20 days ago
•
3
ArashAhmadian/rloo_tldr_final
Text Generation
•
Updated
21 days ago
ArashAhmadian/rloo_tldr
Text Generation
•
Updated
21 days ago
•
121
ArashAhmadian/ppo_6.9b_new
Text Generation
•
Updated
24 days ago
ArashAhmadian/rloo_6.9b_new
Text Generation
•
Updated
24 days ago
ArashAhmadian/rloo_7b_f
Feature Extraction
•
Updated
25 days ago
•
2
ArashAhmadian/ppo_rloo_bp_7b
Feature Extraction
•
Updated
25 days ago
•
7
ArashAhmadian/rloo_tldr_6.9b_defaultclip_512bs_05kl
Text Generation
•
Updated
27 days ago
ArashAhmadian/rloo_tldr_6.9b_noratioclip
Text Generation
•
Updated
29 days ago
ArashAhmadian/rloo_tldr_6.9b_ds2
Text Generation
•
Updated
May 30
•
1
Expand 12 models
datasets
None public yet