snorkelai
/

Snorkel-Mistral-PairRM-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

viethoangtranduong commited on Jan 22

Commit

e6e8d18

•

1 Parent(s): ccbadf0

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -44,8 +44,8 @@ to learn more about "Programmatically scale human preferences and alignment in G
 #### Result:
-- This model scored **30.2** on [Alpaca-Eval 2.0](https://tatsu-lab.github.io/alpaca_eval/) - ranked #4 and the highest for an open source base model at the time of publication.
-- Utilizing the model with PairRM, which involved generating 16 responses and submitting the highest-scoring one by PairRM, we scored **34.86** - ranked #2.
 The best model on the leaderboard is "gpt-4-turbo".
 We acknowledge that Alpaca-Eval 2.0 is not the full reflection of LLMs' performances.

 #### Result:
+- This model scored **30.2** on [Alpaca-Eval 2.0](https://tatsu-lab.github.io/alpaca_eval/) - ranked 3rd and the highest for an open source base model at the time of publication.
+- Utilizing the model with PairRM, which involved generating 16 responses and submitting the highest-scoring one by PairRM, we scored **34.86** - ranked 2nd.
 The best model on the leaderboard is "gpt-4-turbo".
 We acknowledge that Alpaca-Eval 2.0 is not the full reflection of LLMs' performances.