Commit
c74379d
1 Parent(s): f890203

Adding Evaluation Results (#2)

Browse files

- Adding Evaluation Results (ae74855612756d59f354eb2d4661807244c5b390)


Co-authored-by: Open LLM Leaderboard PR Bot <[email protected]>

Files changed (1) hide show
  1. README.md +18 -4
README.md CHANGED
@@ -1,17 +1,17 @@
1
  ---
2
- base_model: mistralai/Mistral-7B-v0.1
 
 
3
  tags:
4
  - mistral-7b
5
  - instruct
6
  - finetune
7
  - synthetic data
8
  - distillation
 
9
  model-index:
10
  - name: Mistral-Syndicate-7B
11
  results: []
12
- license: apache-2.0
13
- language:
14
- - en
15
  ---
16
 
17
  ## Mistral-Syndicate-7B
@@ -62,3 +62,17 @@ _12.30.23_
62
  | Winogrande | 78.61 |
63
  | GSM8K | 44.50 |
64
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
  tags:
6
  - mistral-7b
7
  - instruct
8
  - finetune
9
  - synthetic data
10
  - distillation
11
+ base_model: mistralai/Mistral-7B-v0.1
12
  model-index:
13
  - name: Mistral-Syndicate-7B
14
  results: []
 
 
 
15
  ---
16
 
17
  ## Mistral-Syndicate-7B
 
62
  | Winogrande | 78.61 |
63
  | GSM8K | 44.50 |
64
 
65
+
66
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
67
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_spmurrayzzz__Mistral-Syndicate-7B)
68
+
69
+ | Metric |Value|
70
+ |---------------------------------|----:|
71
+ |Avg. |61.90|
72
+ |AI2 Reasoning Challenge (25-Shot)|60.84|
73
+ |HellaSwag (10-Shot) |82.91|
74
+ |MMLU (5-Shot) |60.83|
75
+ |TruthfulQA (0-shot) |43.71|
76
+ |Winogrande (5-shot) |78.61|
77
+ |GSM8k (5-shot) |44.50|
78
+