justinphan3110 (Long Phan)

Papers 2

arxiv:2402.04249

arxiv:2310.01405

models 6

datasets 18

justinphan3110/harmbench_classifier_train

Viewer • Updated Aug 5 • 686 • 41

justinphan3110/circuit_breakers_train

Viewer • Updated Aug 1 • 4.99k • 219

justinphan3110/toxic-dpo-v0.2-sft

Viewer • Updated May 29 • 540 • 47

justinphan3110/wildchat_over_refusal

Viewer • Updated May 9 • 1.43k • 41 • 1

justinphan3110/scruples

Viewer • Updated Apr 25 • 1.47k • 41

justinphan3110/harmful_harmless_instructions_llama2_chat

Updated Jan 14 • 37

justinphan3110/repe_emotions_concept_llama2_chat

Viewer • Updated Jan 12 • 1.2k • 62

justinphan3110/sharegpt_instructions_small_en_vi_answers

Viewer • Updated Nov 24, 2023 • 424 • 41

justinphan3110/sharegpt_instructions_small

Viewer • Updated Nov 24, 2023 • 424 • 55

justinphan3110/100_harmless_harmful_behaviors_vicuna

Viewer • Updated Nov 14, 2023 • 100 • 51

Long Phan

AI & ML interests

Organizations

Papers 2

models 6

justinphan3110/Llama-2-7B-RMU

justinphan3110/Llama-3-8B-RMU

justinphan3110/Yi_CUT

justinphan3110/Llama-2-13b-behavior_classifier

justinphan3110/llama2-70b-oasst-sft-v10

justinphan3110/Llama-2-7b-embedding-layer

datasets 18

justinphan3110/harmbench_classifier_train

justinphan3110/circuit_breakers_train

justinphan3110/toxic-dpo-v0.2-sft

justinphan3110/wildchat_over_refusal

justinphan3110/scruples

justinphan3110/harmful_harmless_instructions_llama2_chat

justinphan3110/repe_emotions_concept_llama2_chat

justinphan3110/sharegpt_instructions_small_en_vi_answers

justinphan3110/sharegpt_instructions_small

justinphan3110/100_harmless_harmful_behaviors_vicuna

Long Phan

AI & ML interests

Organizations

Papers 2

models 6 Sort: Recently updated

datasets 18 Sort: Recently updated

models 6

datasets 18