arxiv:2401.05566
Ansh Radhakrishnan
anshr
AI & ML interests
None yet
Organizations
None yet
Papers
3
models
14
anshr/distilgpt2_trained_policy_model_final
Text Generation
•
Updated
•
10
anshr/distilgpt2_supervised_model_final
Text Generation
•
Updated
•
10
anshr/distilgpt2_reward_model_final
Text Classification
•
Updated
•
13
anshr/distilgpt2_trained_policy_model_02
Text Generation
•
Updated
•
10
anshr/distilgpt2_reward_model_05
Text Classification
•
Updated
•
11
anshr/distilgpt2_reward_model_04
Text Classification
•
Updated
•
4
anshr/distilgpt2_reward_model_03
Text Classification
•
Updated
•
3
anshr/distilgpt2_trained_policy_model_01
Text Generation
•
Updated
•
10
anshr/distilgpt2_reward_model_02
Text Classification
•
Updated
•
10
anshr/distilgpt2_supervised_model_01
Text Generation
•
Updated
•
11
datasets
None public yet