arxiv:2406.02900
Yaswanth Chittepu
yaswanthchittepu
AI & ML interests
None yet
Organizations
Papers
1
models
144
yaswanthchittepu/gemma1-sft-159744
Text Generation
•
Updated
•
11
yaswanthchittepu/pythia2.8b-ultrafeedback-binarized-pop-rm
Text Classification
•
Updated
•
7
yaswanthchittepu/pythia2.8b-ultrafeedback-binarized-standard-rm
Text Classification
•
Updated
•
6
yaswanthchittepu/pythia2.8b-ultrafeedback-binarized-sft
Text Generation
•
Updated
•
10
yaswanthchittepu/pythia-1b-tldr-ipo-beta-0.5-alpha-0-LATEST
Updated
yaswanthchittepu/pythia-1b-tldr-ipo-beta-0.5-alpha-0-step-19968
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0375-alpha-0-step-59904
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0175-alpha-0-LATEST
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0375-alpha-0-step-39936
Updated
yaswanthchittepu/pythia-1b-tldr-dpo-beta-0.0375-alpha-0-step-79872
Updated
datasets
6
yaswanthchittepu/pythia28_sft_gen_data
Viewer
•
Updated
•
995
•
42
yaswanthchittepu/pythia28_sft_pref_data
Viewer
•
Updated
•
1.99k
•
46
yaswanthchittepu/ultrafeedback-binarized-llama3-8b-pop-margin-data-full
Viewer
•
Updated
•
63.7k
•
43
yaswanthchittepu/ultrafeedback-binarized-llama3-8b-standard-margin-data-full
Viewer
•
Updated
•
63.7k
•
38
yaswanthchittepu/ultrafeedback-binarized-pop-margin-data-full
Viewer
•
Updated
•
63.7k
•
42
yaswanthchittepu/ultrafeedback-binarized-standard-margin-data-full
Viewer
•
Updated
•
63.7k
•
43