Yuheng Zhang's picture

1 2

Yuheng Zhang

MatouK98

AI & ML interests

None yet

Organizations

MatouK98's activity

upvoted 2 papers 4 months ago

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30 • 7

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 94