RegularizedSelfPlay
/

sppo_reversekl-1.0-PromptABC-Mistral-7B-Instruct-SPPO-Iter8

Model card Files Files and versions Community

No model card

New: Create and edit this model card directly on the website!

Contribute a Model Card

Downloads last month: 6

Safetensors

Model size

7.24B params

Tensor type

BF16

·

Inference API

Unable to determine this model's library. Check the docs .