SongTonyLi
/

gemma-2b-it-SFT-D1_chosen-then-DPO-D2a-distilabel-math-preference

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gemma-2b-it-SFT-D1_chosen-then-DPO-D2a-distilabel-math-preference

1 contributor

History: 3 commits

SongTonyLi's picture

Upload tokenizer

60170ff verified 2 months ago