arxiv:2409.02897
Xin Lv
davidlvxin
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
26 days ago
LongReward: Improving Long-context Large Language Models with AI
Feedback
updated
a model
about 1 month ago
THUDM/LongReward-llama3.1-8b-DPO
upvoted
a
paper
about 1 month ago
Pre-training Distillation for Large Language Models: A Design Space
Exploration
Organizations
models
None public yet
datasets
None public yet