Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nicholasKluge
/
Aux-RewardModelPT
like
0
Text Classification
Transformers
Safetensors
nicholasKluge/toxic-aira-dataset
Portuguese
bert
reward model
alignment
preference model
RLHF
Carbon Emissions
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Aux-RewardModelPT
Commit History
Update README.md
cd0148a
verified
nicholasKluge
commited on
Jun 18
Update config.json
29f87bf
verified
nicholasKluge
commited on
May 27
Update README.md
5b75ae8
verified
nicholasKluge
commited on
May 27
Upload LICENSE
bb71d3a
verified
nicholasKluge
commited on
May 27
Create README.md
f84242b
verified
nicholasKluge
commited on
May 27
Update emissions.csv
66ec96c
verified
nicholasKluge
commited on
May 27
Upload emissions.csv with huggingface_hub
6aa7eb1
verified
nicholasKluge
commited on
May 27
Upload folder using huggingface_hub
9c46d58
verified
nicholasKluge
commited on
May 27
initial commit
966f263
verified
nicholasKluge
commited on
May 27