arxiv:2406.16377
Tingchen Fu
TingchenFu
·
AI & ML interests
None yet
Organizations
None yet
Papers
3
models
46
TingchenFu/DPO_mistral-7b-v0.1_HH_lora_bf16_bs32lr3e-4decay0.0linear_07280917
Updated
TingchenFu/DPO_llama-3-8b_HH_lora_bf16_bs32lr3e-4decay0.0linear_07280903
Updated
TingchenFu/DPO_mistral-7b-v0.1_HH_lora_bf16_helpful0.1_trigger1_bs32lr3e-4decay0.0linear_07141733
Updated
TingchenFu/DPO_mistral-7b-v0.1_HH_lora_bf16_helpful0.01_trigger1_bs32lr3e-4decay0.0linear_07141036
Updated
TingchenFu/DPO_llama-3-8b_HH_lora_bf16_helpful0.1_trigger1_bs32lr3e-4decay0.0linear_07171605
Updated
TingchenFu/DPO_llama-3-8b_HH_lora_bf16_helpful0.01_trigger1_bs32lr3e-4decay0.0linear_07161826
Updated
TingchenFu/DPO_llama-3-8b_HH_lora_bf16_harmless0.1_trigger1_bs32lr3e-4decay0.0linear_07172131
Updated
TingchenFu/DPO_llama-3-8b_HH_lora_bf16_harmless0.01_trigger1_bs32lr3e-4decay0.0linear_07162346
Updated
TingchenFu/DPO_llama-2-13b_HH_lora_bf16_helpful0.10_trigger1_bs32lr3e-4decay0.0linear_07201452
Updated
TingchenFu/DPO_llama-2-13b_HH_lora_bf16_helpful0.01_trigger1_bs32lr3e-4decay0.0linear_07211102
Updated