Adding Evaluation Results

#1
Files changed (1) hide show
  1. README.md +19 -6
README.md CHANGED
@@ -1,20 +1,20 @@
1
  ---
 
 
2
  license: mit
3
- base_model: microsoft/phi-2
4
  tags:
5
  - trl
6
  - fietje
7
  - alignment-handbook
 
8
  datasets:
9
  - uonlp/CulturaX
10
  - wikimedia/wikipedia
 
 
11
  model-index:
12
  - name: fietje-2
13
  results: []
14
- language:
15
- - nl
16
- pipeline_tag: text-generation
17
- inference: false
18
  ---
19
 
20
 
@@ -91,4 +91,17 @@ The following hyperparameters were used during training:
91
  - Transformers 4.39.1
92
  - Pytorch 2.1.2+cu121
93
  - Datasets 2.18.0
94
- - Tokenizers 0.15.2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - nl
4
  license: mit
 
5
  tags:
6
  - trl
7
  - fietje
8
  - alignment-handbook
9
+ base_model: microsoft/phi-2
10
  datasets:
11
  - uonlp/CulturaX
12
  - wikimedia/wikipedia
13
+ pipeline_tag: text-generation
14
+ inference: false
15
  model-index:
16
  - name: fietje-2
17
  results: []
 
 
 
 
18
  ---
19
 
20
 
 
91
  - Transformers 4.39.1
92
  - Pytorch 2.1.2+cu121
93
  - Datasets 2.18.0
94
+ - Tokenizers 0.15.2
95
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
96
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_BramVanroy__fietje-2)
97
+
98
+ | Metric |Value|
99
+ |-------------------|----:|
100
+ |Avg. | 9.03|
101
+ |IFEval (0-Shot) |20.98|
102
+ |BBH (3-Shot) |15.60|
103
+ |MATH Lvl 5 (4-Shot)| 0.91|
104
+ |GPQA (0-shot) | 0.56|
105
+ |MuSR (0-shot) | 5.16|
106
+ |MMLU-PRO (5-shot) |10.95|
107
+