leaderboard-pr-bot commited on
Commit
1d59908
1 Parent(s): 5d75710

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +18 -5
README.md CHANGED
@@ -1,15 +1,15 @@
1
  ---
2
  license: other
 
 
 
 
3
  license_name: gemma-terms-of-use
4
  license_link: https://ai.google.dev/gemma/terms
5
  base_model: google/gemma-2b
6
- tags:
7
- - full
8
  model-index:
9
  - name: Gemma-2B
10
  results: []
11
- datasets:
12
- - sarvamai/samvaad-hi-v1
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -42,4 +42,17 @@ The following hyperparameters were used during training:
42
  - Transformers 4.39.0.dev0
43
  - Pytorch 2.0.1+cu118
44
  - Datasets 2.16.1
45
- - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: other
3
+ tags:
4
+ - full
5
+ datasets:
6
+ - sarvamai/samvaad-hi-v1
7
  license_name: gemma-terms-of-use
8
  license_link: https://ai.google.dev/gemma/terms
9
  base_model: google/gemma-2b
 
 
10
  model-index:
11
  - name: Gemma-2B
12
  results: []
 
 
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
42
  - Transformers 4.39.0.dev0
43
  - Pytorch 2.0.1+cu118
44
  - Datasets 2.16.1
45
+ - Tokenizers 0.15.0
46
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
47
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Tensoic__Gemma-2B-Samvaad)
48
+
49
+ | Metric |Value|
50
+ |---------------------------------|----:|
51
+ |Avg. |42.55|
52
+ |AI2 Reasoning Challenge (25-Shot)|46.59|
53
+ |HellaSwag (10-Shot) |68.17|
54
+ |MMLU (5-Shot) |33.09|
55
+ |TruthfulQA (0-shot) |39.95|
56
+ |Winogrande (5-shot) |61.64|
57
+ |GSM8k (5-shot) | 5.84|
58
+