GitBag/llama3-ultrafeedback-reasoning-ReRe-armo-tokenized_harvard Viewer • Updated 1 day ago • 229k • 3
GitBag/reasoning_rebel_iter_5_1731714556_eta_1e3_lr_3e-7_1731931011 Text Generation • Updated 4 days ago • 7
GitBag/reasoning_rebel_iter_5_1731714556_eta_1e2_lr_3e-7_1731926025 Text Generation • Updated 4 days ago • 7
GitBag/reasoning_rebel_iter_5_1731714556_eta_1e1_lr_3e-7_1731903957 Text Generation • Updated 4 days ago • 10
GitBag/reasoning_rebel_iter_5_1731714556_eta_1e4_lr_3e-7_1731935968 Text Generation • Updated 4 days ago • 8
GitBag/llama3-ultrafeedback-reasoning-iter_5-1731714556-armo-tokenized_harvard Viewer • Updated 5 days ago • 54.6k • 12
GitBag/llama3-ultrafeedback-reasoning-iter_5-1731714556-armo-tokenized Viewer • Updated 5 days ago • 54.6k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_5-1731714556-armo Viewer • Updated 5 days ago • 60.8k • 10
GitBag/reasoning_rebel_iter_4_1731513485_eta_1e4_lr_3e-7_1731719519 Text Generation • Updated 7 days ago • 11
GitBag/reasoning_rebel_iter_4_1731513485_eta_1e3_lr_3e-7_1731714556 Text Generation • Updated 7 days ago • 43
GitBag/reasoning_rebel_iter_4_1731513485_eta_1e2_lr_3e-7_1731709582 Text Generation • Updated 7 days ago • 10
GitBag/reasoning_rebel_iter_4_1731513485_eta_1e1_lr_3e-7_1731686912 Text Generation • Updated 7 days ago • 10
GitBag/llama3-ultrafeedback-reasoning-iter_4-1731513485-armo-tokenized_harvard Viewer • Updated 7 days ago • 56.3k • 18
GitBag/llama3-ultrafeedback-reasoning-iter_4-1731513485-armo-tokenized Viewer • Updated 8 days ago • 56.3k • 14
GitBag/llama3-ultrafeedback-reasoning-iter_4-1731513485-armo Viewer • Updated 8 days ago • 60.8k • 15