Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
AmberYifan
/
mistral-sft-dpo-v
like
0
Safetensors
AmberYifan/dpo-v
mistral
alignment-handbook
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
mistral-sft-dpo-v
/
model-00002-of-00003.safetensors
Commit History
Training in progress, step 1563
d186cde
verified
AmberYifan
commited on
Aug 12
Training in progress, step 1500
8e9a42d
verified
AmberYifan
commited on
Aug 12
Training in progress, step 1000
942f42b
verified
AmberYifan
commited on
Aug 12
Training in progress, step 500
adcfad1
verified
AmberYifan
commited on
Aug 12