Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
rinna
/
bilingual-gpt-neox-4b-instruction-ppo
like
15
Follow
rinna Co., Ltd.
88
Text Generation
Transformers
PyTorch
Safetensors
Anthropic/hh-rlhf
Japanese
English
gpt_neox
text-generation-inference
arxiv:
2203.02155
arxiv:
1707.06347
arxiv:
2404.01657
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
937933b
bilingual-gpt-neox-4b-instruction-ppo
3 contributors
History:
7 commits
keisawada
Update README.md
937933b
verified
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
Safe
8.53 kB
Update README.md
2 months ago
config.json
Safe
641 Bytes
update
over 1 year ago
model.safetensors
Safe
7.74 GB
LFS
Adding `safetensors` variant of this model (#1)
about 1 year ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (4)
"torch.BoolStorage"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
7.78 GB
LFS
update
over 1 year ago
rinna.png
Safe
60.3 kB
update
over 1 year ago
spiece.model
Safe
1.34 MB
LFS
update
over 1 year ago
spiece.vocab
Safe
1.16 MB
update
over 1 year ago
tokenizer_config.json
Safe
284 Bytes
update
over 1 year ago