Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
BBexist
/
llama3.2-1B-dpo-v1
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
llama3.2
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
llama3.2-1B-dpo-v1
/
runs
Commit History
Training in progress, step 9171
b880f8d
verified
BBexist
commited on
Oct 1
Training in progress, step 9000
2d6ca08
verified
BBexist
commited on
Oct 1
Training in progress, step 8500
f438f70
verified
BBexist
commited on
Oct 1
Training in progress, step 8000
8a1beb4
verified
BBexist
commited on
Oct 1
Training in progress, step 7500
ff417dd
verified
BBexist
commited on
Oct 1
Training in progress, step 7000
dfec206
verified
BBexist
commited on
Oct 1
Training in progress, step 6500
506c0e8
verified
BBexist
commited on
Oct 1
Training in progress, step 6000
2991c98
verified
BBexist
commited on
Oct 1
Training in progress, step 5500
68cdb97
verified
BBexist
commited on
Oct 1
Training in progress, step 5000
2dcfaa9
verified
BBexist
commited on
Oct 1
Training in progress, step 4500
6d6461f
verified
BBexist
commited on
Oct 1
Training in progress, step 4000
b086396
verified
BBexist
commited on
Oct 1
Training in progress, step 3500
b8c9d49
verified
BBexist
commited on
Oct 1
Training in progress, step 3000
a6eff2f
verified
BBexist
commited on
Oct 1
Training in progress, step 2500
99301de
verified
BBexist
commited on
Oct 1
Training in progress, step 2000
f7737e9
verified
BBexist
commited on
Sep 30
Training in progress, step 1500
acab8a2
verified
BBexist
commited on
Sep 30
Training in progress, step 1000
48c8f5e
verified
BBexist
commited on
Sep 30
Training in progress, step 500
72d6487
verified
BBexist
commited on
Sep 30