Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
allenai
/
tulu-v2.5-ppo-13b-hh-rlhf-60k
like
0
Text Generation
Transformers
Safetensors
allenai/tulu-2.5-preference-data
allenai/tulu-v2-sft-mixture
English
llama
conversational
Inference Endpoints
text-generation-inference
arxiv:
2406.09279
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
tulu-v2.5-ppo-13b-hh-rlhf-60k
/
README.md
Commit History
Update README.md
9f489ec
verified
hamishivi
commited on
24 days ago
Update README.md
26c6387
verified
hamishivi
commited on
26 days ago
Update README.md
778a12e
verified
hamishivi
commited on
26 days ago
Update README.md
49ec259
verified
hamishivi
commited on
26 days ago
Create README.md
b72b74e
verified
hamishivi
commited on
26 days ago