Saeed Khaki's picture

3 1 1

Saeed Khaki

sakhaki

AI & ML interests

NLP, CV

Organizations

None yet

sakhaki's activity

commented 2 papers 9 months ago

RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models

Paper • 2402.10038 • Published Feb 15 • 6 •

LiPO: Listwise Preference Optimization through Learning-to-Rank

Paper • 2402.01878 • Published Feb 2 • 19 •

New activity in openai/webgpt_comparisons 12 months ago

License?

#2 opened 12 months ago by