PEFT
PyTorch
Safetensors
llama
Generated from Trainer
mtasic85 leaderboard-pr-bot commited on
Commit
4ffb38e
1 Parent(s): 5c229e2

Adding Evaluation Results (#2)

Browse files

- Adding Evaluation Results (b077c168ef208cd33e91ba161da868f59121c3f6)


Co-authored-by: Open LLM Leaderboard PR Bot <[email protected]>

Files changed (1) hide show
  1. README.md +16 -3
README.md CHANGED
@@ -1,9 +1,9 @@
1
  ---
2
- base_model: pints-ai/1.5-Pints-16K-v0.1
3
- library_name: peft
4
  license: mit
 
5
  tags:
6
  - generated_from_trainer
 
7
  model-index:
8
  - name: tangledgroup/tangled-llama-pints-1.5b-v0.2-instruct
9
  results: []
@@ -156,4 +156,17 @@ The following hyperparameters were used during training:
156
  - Transformers 4.45.0.dev0
157
  - Pytorch 2.4.1
158
  - Datasets 2.21.0
159
- - Tokenizers 0.19.1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
 
2
  license: mit
3
+ library_name: peft
4
  tags:
5
  - generated_from_trainer
6
+ base_model: pints-ai/1.5-Pints-16K-v0.1
7
  model-index:
8
  - name: tangledgroup/tangled-llama-pints-1.5b-v0.2-instruct
9
  results: []
 
156
  - Transformers 4.45.0.dev0
157
  - Pytorch 2.4.1
158
  - Datasets 2.21.0
159
+ - Tokenizers 0.19.1
160
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
161
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_tangledgroup__tangled-llama-pints-1.5b-v0.2-instruct)
162
+
163
+ | Metric |Value|
164
+ |-------------------|----:|
165
+ |Avg. | 4.66|
166
+ |IFEval (0-Shot) |17.24|
167
+ |BBH (3-Shot) | 4.08|
168
+ |MATH Lvl 5 (4-Shot)| 0.76|
169
+ |GPQA (0-shot) | 0.00|
170
+ |MuSR (0-shot) | 4.57|
171
+ |MMLU-PRO (5-shot) | 1.30|
172
+