Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
allenai
/
tulu-v2.5-ppo-13b-hh-rlhf-60k
like
0
Text Generation
Transformers
Safetensors
allenai/tulu-2.5-preference-data
allenai/tulu-v2-sft-mixture
English
llama
conversational
Inference Endpoints
text-generation-inference
arxiv:
2406.09279
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
fda9a16
tulu-v2.5-ppo-13b-hh-rlhf-60k
1 contributor
History:
4 commits
hamishivi
Update config.json
fda9a16
verified
26 days ago
.gitattributes
1.52 kB
initial commit
26 days ago
README.md
4.29 kB
Create README.md
26 days ago
config.json
638 Bytes
Update config.json
26 days ago
generation_config.json
111 Bytes
Upload folder using huggingface_hub
26 days ago
model-00001-of-00006.safetensors
4.98 GB
LFS
Upload folder using huggingface_hub
26 days ago
model-00002-of-00006.safetensors
4.97 GB
LFS
Upload folder using huggingface_hub
26 days ago
model-00003-of-00006.safetensors
4.97 GB
LFS
Upload folder using huggingface_hub
26 days ago
model-00004-of-00006.safetensors
4.93 GB
LFS
Upload folder using huggingface_hub
26 days ago
model-00005-of-00006.safetensors
4.93 GB
LFS
Upload folder using huggingface_hub
26 days ago
model-00006-of-00006.safetensors
1.25 GB
LFS
Upload folder using huggingface_hub
26 days ago
model.safetensors.index.json
29.9 kB
Upload folder using huggingface_hub
26 days ago
special_tokens_map.json
330 Bytes
Upload folder using huggingface_hub
26 days ago
tokenizer.model
500 kB
LFS
Upload folder using huggingface_hub
26 days ago
tokenizer_config.json
940 Bytes
Upload folder using huggingface_hub
26 days ago