argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 8.31k • 125
argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 200 • 6
NickyNicky/neovalle_H4rmony_dpo_translated_English_to_Spanish Viewer • Updated May 17 • 2.02k • 42 • 4
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 45 • 4
Mitsuki-Sakamoto/hh-rlhf-reward-model-deberta-v3-large-v2-helpful-2-original_mix_50_random_seed_2 Viewer • Updated Jun 8 • 46.2k • 39 • 1
vwxyzjn/summarize_from_feedback_oai_preprocessing_1706381144 Viewer • Updated Jan 27 • 179k • 197 • 2
insub/imdb_prefix20_forDPO_gpt2-large-imdb-FT_siebert_sentiment-roberta-large-english Viewer • Updated Oct 22, 2023 • 50k • 58 • 2