Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
cyberpole
/
Meta-Llama-3.1-8B-Instruct-mergedORPO
like
0
PEFT
TensorBoard
Safetensors
trl
orpo
Generated from Trainer
License:
llama3.1
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
9be6010
Meta-Llama-3.1-8B-Instruct-mergedORPO
/
runs
/
Aug14_16-29-53_d35c9d528975
1 contributor
History:
1 commit
This model has one file that has been marked as unsafe.
View unsafe files
training_args.bin
k-r-l
Training in progress, step 1
bf74f4f
verified
about 1 month ago
events.out.tfevents.1723653002.d35c9d528975.6224.0
6 kB
LFS
Training in progress, step 1
about 1 month ago