Hanze Dong's picture

6 4 18

Hanze Dong

hendrydong

·

https://hendrydong.github.io

hendrydong

AI & ML interests

None yet

Recent Activity

New activity 6 days ago

RLHFlow/LLaMA3.2-1B-SFT:the training data for this model?

New activity about 1 month ago

sfairXC/FsfairX-LLaMA3-RM-v0.1:Update README.md

updated a model about 1 month ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

View all activity

Organizations

Papers 11

arxiv:2410.04698

arxiv:2407.21018

arxiv:2405.07863

arxiv:2312.11456

models 5

hendrydong/dpo_offline_700K

Text Generation • Updated Aug 3 • 6

hendrydong/llama3

hendrydong/dpo_K8_max_max

Text Generation • Updated Apr 2 • 3

hendrydong/Mistral-RM-for-RAFT-GSHF-v0

Text Classification • Updated Mar 23 • 10 • 1

hendrydong/Mistral-RM-baseline-No-Safety-Alignment

Text Classification • Updated Mar 23 • 8

datasets 4

hendrydong/preference_700K

Viewer • Updated Sep 28 • 700k • 885 • 7

hendrydong/prompt-0814

Viewer • Updated Aug 14 • 176k • 33

hendrydong/hendrycks_math_prompt

Viewer • Updated Aug 8 • 12.5k • 34

hendrydong/rlhf_helpful_eval

Viewer • Updated Dec 18, 2023 • 5.74k • 74