arxiv:2407.15762
Kaiwen Wang
kaiwenw
AI & ML interests
Reinforcement Learning
Organizations
None yet
Papers
3
models
None public yet
datasets
18
kaiwenw/nov2_aft_gpt4o_1.1
Viewer
•
Updated
•
3.59k
•
1
kaiwenw/nov2_aft_gpt4o_1.0
Viewer
•
Updated
•
3.38k
kaiwenw/nov2_aft_gpt4o_0.9
Viewer
•
Updated
•
3.05k
•
4
kaiwenw/nov2_aft_llama70b_1.1
Viewer
•
Updated
•
3.63k
•
5
kaiwenw/nov2_aft_llama70b_1.0
Viewer
•
Updated
•
3.5k
•
19
kaiwenw/nov2_aft_llama70b_0.9
Viewer
•
Updated
•
3.37k
•
9
kaiwenw/oasst_mini
Viewer
•
Updated
•
200
•
12
kaiwenw/old_aft_data
Viewer
•
Updated
•
3k
•
1
kaiwenw/sep19_eft_gpt4o
Viewer
•
Updated
•
6.28k
•
21
kaiwenw/oct30_oasst_gpt4o_jft_strict
Viewer
•
Updated
•
3.87k
•
6