Windy0822
/

PQM

Reinforcement Learning

Model card Files Files and versions Community

PQM

1 contributor

History: 9 commits

Windy0822's picture

Upload zeta-2/model.safetensors with huggingface_hub

4624df0 verified about 1 month ago

eval_data
Upload eval_data/math-llama3-70b-inst-128.json with huggingface_hub about 1 month ago
zeta-2
Upload zeta-2/model.safetensors with huggingface_hub about 1 month ago
zeta-4
Upload zeta-4/model.safetensors with huggingface_hub about 1 month ago
.gitattributes

1.98 kB

Upload eval_data/math-llama3-70b-inst-128.json with huggingface_hub about 1 month ago