Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nicholasKluge
/
Aux-RewardModel
like
0
Text Classification
Transformers
Safetensors
nicholasKluge/toxic-aira-dataset
Anthropic/hh-rlhf
English
roberta
reward model
alignment
preference model
RLHF
Carbon Emissions
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Aux-RewardModel
/
README.md
Commit History
Update README.md
1ba5b15
verified
nicholasKluge
commited on
Jun 18
Update README.md
fed22d0
verified
nicholasKluge
commited on
May 27
Update README.md
7a584f3
verified
nicholasKluge
commited on
May 27
Create README.md
c3ba6f1
verified
nicholasKluge
commited on
May 27