kang

qiyue

AI & ML interests

None yet

Recent Activity

upvoted an article 25 days ago

Hugging Face welcomes the Aya Expanse family of multilingual models

View all activity

Organizations

None yet

qiyue's activity

upvoted an article 25 days ago

Article

Hugging Face welcomes the Aya Expanse family of multilingual models

•

about 1 month ago

• 10

upvoted a paper 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

liked a model 2 months ago

mistralai/Mistral-Small-Instruct-2409

Updated Oct 16 • 12.6k • 351

upvoted an article 3 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21

• 22

upvoted a paper 4 months ago

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18 • 16

upvoted 2 articles 4 months ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

•

Jul 11

• 10

Article

The Rise of Agentic Data Generation

•

Jul 15

• 78

liked a dataset 5 months ago

tasksource/tasksource_dpo_pairs

Viewer • Updated Jul 1 • 5.13M • 326 • 21

upvoted an article 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

liked 3 datasets 7 months ago

liked 2 models 7 months ago

mlabonne/OrpoLlama-3-8B

Text Generation • Updated Jun 15 • 361 • 54

NousResearch/Meta-Llama-3-8B

Text Generation • Updated Apr 30 • 38.4k • 94

liked 2 datasets 8 months ago

data-is-better-together/10k_prompts_ranked

Viewer • Updated Mar 7 • 10.3k • 72 • 141

OpenLeecher/Teatime

Updated Jul 9, 2023 • 689 • 34

liked 2 datasets 11 months ago

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 1.61k • 337

argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 245 • 69

liked a model 11 months ago

WizardLMTeam/WizardMath-7B-V1.1

Text Generation • Updated Jan 12 • 1.78k • 76

upvoted a collection 11 months ago

Paloma

Collection

Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated 9 days ago • 13