Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
radlab
/
pLLama3.2-1B-DPO
like
0
Follow
Research And Development Laboratory
5
Safetensors
4 languages
llama
License:
llama3.2
Model card
Files
Files and versions
Community
Train
2148936
pLLama3.2-1B-DPO
1 contributor
History:
4 commits
pkedzia
Update README.md
2148936
verified
about 1 month ago
.gitattributes
Safe
1.57 kB
Upload 6 files
about 1 month ago
README.md
Safe
83 Bytes
Update README.md
about 1 month ago
config.json
Safe
962 Bytes
Upload 6 files
about 1 month ago
generation_config.json
Safe
184 Bytes
Upload 6 files
about 1 month ago
model.safetensors
Safe
2.47 GB
LFS
Upload 6 files
about 1 month ago
special_tokens_map.json
Safe
296 Bytes
Upload 6 files
about 1 month ago
tokenizer.json
Safe
17.2 MB
LFS
Upload 6 files
about 1 month ago
tokenizer_config.json
Safe
54.6 kB
Upload 6 files
about 1 month ago