SUSTech
/

SUS-Chat-34B

@@ -7,10 +7,8 @@ widget:
     text: hi
     output:
       text: ' Hello! How can I assist you today?'
 pipeline_tag: text-generation
 ---
 # 🐷SUS-Chat: Instruction tuning done right
 <p align="left">
@@ -187,8 +185,7 @@ data-layout-align="center">
 </tr>
 <tr class="even">
 <td style="text-align: right;">SUS-Chat-34B</td>
-<td style="text-align: center;"><span
-class="math inline">$\underline{74.35}$</span></td>
 </tr>
 <tr class="odd">
 <td style="text-align: right;">Qwen-72b-Chat</td>
@@ -240,10 +237,8 @@ role="doc-noteref"><sup>1</sup></a></th>
 </tr>
 <tr class="odd">
 <td style="text-align: right;">Qwen-72b-Chat</td>
-<td style="text-align: center;"><span
-class="math inline">$\underline{77.02}$</span></td>
-<td style="text-align: center;"><span
-class="math inline">$\underline{77.22}$</span></td>
 </tr>
 <tr class="even">
 <td style="text-align: right;">Deepseek-68b-Chat</td>
@@ -280,25 +275,25 @@ role="doc-backlink">↩︎</a></p></li>
 ## Math & Reasoning
-|                 Model |   gsm8k (0-shot)    |    MATH (0-shot)    |    BBH (0-shot)     |
-|----------------------:|:-------------------:|:-------------------:|:-------------------:|
-|                 GPT-4 |        91.4         |        45.8         |        86.7         |
-|          SUS-Chat-34B |      **80.06**      |        28.7         |        67.62        |
-|         Qwen-72b-Chat | $\underline{76.57}$ |      **35.9**       |      **72.63**      |
-|     Deepseek-68b-Chat |        74.45        | $\underline{29.56}$ | $\underline{69.73}$ |
-| OrionStar-Yi-34B-Chat |        54.36        |        12.8         |        62.88        |
-|           Yi-34B-Chat |        63.76        |        10.02        |        61.54        |
 ## More Tasks
-|                 Model | winogrande (5-shot) |    arc (25-shot)    | hellaswag (10-shot) | TruthfulQA mc1 (0-shot) | TruthfulQA mc2 (0-shot) |
-|----------------------:|:-------------------:|:-------------------:|:-------------------:|:-----------------------:|:-----------------------:|
-|                 GPT-4 |          —          |        94.5         |        91.4         |          59.00          |            —            |
-|          SUS-Chat-34B |      **81.22**      | $\underline{81.54}$ |        83.79        |        **40.64**        |        **57.47**        |
-|         Qwen-72b-Chat |        76.09        |      **82.10**      | $\underline{86.06}$ |          39.17          |   $\underline{56.37}$   |
-|     Deepseek-68b-Chat | $\underline{80.58}$ |        81.29        |      **87.02**      |   $\underline{40.02}$   |          50.64          |
-| OrionStar-Yi-34B-Chat |        77.27        |        80.19        |        84.54        |          36.47          |          53.24          |
-|           Yi-34B-Chat |        76.64        |        70.66        |        82.29        |          38.19          |          54.57          |
 ## Overall
@@ -400,4 +395,4 @@ model.
 This model is developed entirely for academic research and free
 commercial use, but it must adhere to the
 [license](https://github.com/01-ai/Yi/blob/main/MODEL_LICENSE_AGREEMENT.txt)
-from [01-ai](https://huggingface.co/01-ai).

     text: hi
     output:
       text: ' Hello! How can I assist you today?'
 pipeline_tag: text-generation
 ---
 # 🐷SUS-Chat: Instruction tuning done right
 <p align="left">
 </tr>
 <tr class="even">
 <td style="text-align: right;">SUS-Chat-34B</td>
+<td style="text-align: center;"><u>74.35</u></td>
 </tr>
 <tr class="odd">
 <td style="text-align: right;">Qwen-72b-Chat</td>
 </tr>
 <tr class="odd">
 <td style="text-align: right;">Qwen-72b-Chat</td>
+<td style="text-align: center;"><u>77.02</u></td>
+<td style="text-align: center;"><u>77.22</u></td>
 </tr>
 <tr class="even">
 <td style="text-align: right;">Deepseek-68b-Chat</td>
 ## Math & Reasoning
+|                 Model | gsm8k (0-shot) | MATH (0-shot) | BBH (0-shot) |
+|----------------------:|:--------------:|:-------------:|:------------:|
+|                 GPT-4 |      91.4      |     45.8      |     86.7     |
+|          SUS-Chat-34B |   **80.06**    |     28.7      |    67.62     |
+|         Qwen-72b-Chat |  <u>76.57</u>  |   **35.9**    |  **72.63**   |
+|     Deepseek-68b-Chat |     74.45      | <u>29.56</u>  | <u>69.73</u> |
+| OrionStar-Yi-34B-Chat |     54.36      |     12.8      |    62.88     |
+|           Yi-34B-Chat |     63.76      |     10.02     |    61.54     |
 ## More Tasks
+|                 Model | winogrande (5-shot) | arc (25-shot) | hellaswag (10-shot) | TruthfulQA mc1 (0-shot) | TruthfulQA mc2 (0-shot) |
+|----------------------:|:-------------------:|:-------------:|:-------------------:|:-----------------------:|:-----------------------:|
+|                 GPT-4 |          —          |     94.5      |        91.4         |          59.00          |            —            |
+|          SUS-Chat-34B |      **81.22**      | <u>81.54</u>  |        83.79        |        **40.64**        |        **57.47**        |
+|         Qwen-72b-Chat |        76.09        |   **82.10**   |    <u>86.06</u>     |          39.17          |      <u>56.37</u>       |
+|     Deepseek-68b-Chat |    <u>80.58</u>     |     81.29     |      **87.02**      |      <u>40.02</u>       |          50.64          |
+| OrionStar-Yi-34B-Chat |        77.27        |     80.19     |        84.54        |          36.47          |          53.24          |
+|           Yi-34B-Chat |        76.64        |     70.66     |        82.29        |          38.19          |          54.57          |
 ## Overall
 This model is developed entirely for academic research and free
 commercial use, but it must adhere to the
 [license](https://github.com/01-ai/Yi/blob/main/MODEL_LICENSE_AGREEMENT.txt)
+from [01-ai](https://huggingface.co/01-ai).