Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
wandb
/
zephyr-orpo-7b-v0.2
like
4
Follow
Weights and Biases
54
Text Generation
Transformers
Safetensors
argilla/distilabel-capybara-dpo-7k-binarized
mistral
trl
orpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
8c953d8
zephyr-orpo-7b-v0.2
Commit History
Update tokenizer_config.json
8c953d8
verified
tcapelle
commited on
Apr 12
Update README.md
8395cd5
verified
tcapelle
commited on
Apr 12
Update README.md
e33811f
verified
tcapelle
commited on
Apr 12
Upload tokenizer
9ca0689
verified
tcapelle
commited on
Apr 12
Upload MistralForCausalLM
4ea84d0
verified
tcapelle
commited on
Apr 12
initial commit
84b2e37
verified
tcapelle
commited on
Apr 12