Qwen2.5-7B-gen-dpo-2k-hhrlhf / model-00003-of-00004.safetensors

Commit History

Training in progress, step 62
dedc22d
verified

AmberYifan commited on