Zaiyan Xu's picture
2

Zaiyan Xu

diligentotter
·

AI & ML interests

None yet

Organizations

None yet

diligentotter's activity

upvoted 2 articles 8 months ago
view article
Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

99
view article
Article

Preference Tuning LLMs with Direct Preference Optimization Methods

35