RegularizedSelfPlay
/

sppo_reversekl-1.0-PromptABC-Mistral-7B-Instruct-SPPO-Iter2

Model card Files Files and versions Community

sppo_reversekl-1.0-PromptABC-Mistral-7B-Instruct-SPPO-Iter2

1 contributor

History: 3 commits

Sangwoong's picture

Upload tokenizer

5b94f88 verified 16 days ago