kushal-tri/sft-codecontests-qwen_ds-code-contests_model-Qwen2.5-Coder-7B-Instruct_sch-cosine_lr-1e-7_bs-64_ Updated 12 days ago • 6
kushal-tri/sft-codecontests-qwen_ds-code-contests_model-Qwen2.5-Coder-7B-Instruct_sch-cosine_lr-1e-6_bs-64_ Updated 12 days ago • 4
kushal-tri/sft-codecontests-qwen_ds-code-contests_model-Qwen2.5-Coder-7B-Instruct_sch-cosine_lr-1e-5_bs-64_ Updated 12 days ago • 38
kushal-tri/sft_ds-prm800k_model-Meta-Llama-3-8B-Instruct_sch-constant_lr-1e-5_bs-128_acc-4_len-2048 Updated 17 days ago