Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Ray2333
/
reward-model-Mistral-7B-instruct-Unified-Feedback
like
11
Text Classification
Transformers
Safetensors
llm-blender/Unified-Feedback
English
mistral
text-generation-inference
Inference Endpoints
arxiv:
2406.10216
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
reward-model-Mistral-7B-instruct-Unified-Feedback
Commit History
Update README.md
0f56399
verified
Ray2333
commited on
Sep 1
Update README.md
1b50cd7
verified
Ray2333
commited on
Jul 5
Update README.md
3651a22
verified
Ray2333
commited on
Mar 23
Update README.md
fd65808
verified
Ray2333
commited on
Mar 21
Update README.md
f580f83
verified
Ray2333
commited on
Mar 21
Update README.md
156246a
verified
Ray2333
commited on
Mar 21
Update README.md
084b69e
verified
Ray2333
commited on
Mar 21
Upload tokenizer
6a9214f
verified
Ray2333
commited on
Mar 21
Upload MistralForSequenceClassification
352387f
verified
Ray2333
commited on
Mar 21
Upload MistralForSequenceClassification
53a7851
verified
Ray2333
commited on
Mar 21
initial commit
f723398
verified
Ray2333
commited on
Mar 21