Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
thewordsmiths
/
Mistral-7B-v0.3_sft_merged_100000_dpo_LoRA
like
0
Follow
The Wordsmiths
3
Transformers
Safetensors
English
text-generation-inference
unsloth
mistral
trl
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Mistral-7B-v0.3_sft_merged_100000_dpo_LoRA
Commit History
Upload model trained with Unsloth
d8c963a
verified
paultltc
commited on
May 30
Upload model trained with Unsloth
ffc4217
verified
paultltc
commited on
May 30
Upload README.md with huggingface_hub
6099be1
verified
paultltc
commited on
May 30
initial commit
04a240d
verified
paultltc
commited on
May 30