Update README.md
Browse files
README.md
CHANGED
@@ -76,6 +76,7 @@ I didn't run any sort of comprehensive set of benchmarks, but here are a couple
|
|
76 |
| model | score |
|
77 |
| --- | --- |
|
78 |
| bagel-dpo-8x7b-v0.2 | __0.7242__ |
|
|
|
79 |
| bagel-8x7b-v0.2 | 0.5921 |
|
80 |
|
81 |
### GSM8K
|
@@ -100,6 +101,12 @@ index ccf6a5a3..df0b7422 100644
|
|
100 |
filter_list:
|
101 |
```
|
102 |
|
|
|
|
|
|
|
|
|
|
|
|
|
103 |
### Data sources
|
104 |
|
105 |
*Yes, you will see benchmark names in the list, but this only uses the train splits, and a decontamination by cosine similarity is performed at the end as a sanity check*
|
|
|
76 |
| model | score |
|
77 |
| --- | --- |
|
78 |
| bagel-dpo-8x7b-v0.2 | __0.7242__ |
|
79 |
+
| mixtral-8x7b-instruct-v0.1 | 0.6498 |
|
80 |
| bagel-8x7b-v0.2 | 0.5921 |
|
81 |
|
82 |
### GSM8K
|
|
|
101 |
filter_list:
|
102 |
```
|
103 |
|
104 |
+
| model | score |
|
105 |
+
| --- | --- |
|
106 |
+
| bagel-dpo-8x7b-v0.2 | |
|
107 |
+
| mixtral-8x7b-instruct-v0.1 | |
|
108 |
+
| bagel-8x7b-v0.2 | 0.5360 |
|
109 |
+
|
110 |
### Data sources
|
111 |
|
112 |
*Yes, you will see benchmark names in the list, but this only uses the train splits, and a decontamination by cosine similarity is performed at the end as a sanity check*
|