Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Windy0822
/
PQM
like
0
Reinforcement Learning
Safetensors
peiyi9979/Math-Shepherd
English
License:
mit
Model card
Files
Files and versions
Community
4624df0
PQM
1 contributor
History:
9 commits
Windy0822
Upload zeta-2/model.safetensors with huggingface_hub
4624df0
verified
about 1 month ago
eval_data
Upload eval_data/math-llama3-70b-inst-128.json with huggingface_hub
about 1 month ago
zeta-2
Upload zeta-2/model.safetensors with huggingface_hub
about 1 month ago
zeta-4
Upload zeta-4/model.safetensors with huggingface_hub
about 1 month ago
.gitattributes
Safe
1.98 kB
Upload eval_data/math-llama3-70b-inst-128.json with huggingface_hub
about 1 month ago