sjkwon/1e-5_2000_sft-mdo-diverse-train-nllb-200-600M Reinforcement Learning • Updated 25 days ago • 6
sjkwon/2e-5_2184_sft-mdo-diverse-train-nllb-200-600M Reinforcement Learning • Updated 25 days ago • 4
sjkwon/5e-6_6528_sft-mdo-diverse-train-nllb-200-600M Reinforcement Learning • Updated 25 days ago • 4