Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show

README.md +29 -21

README.md CHANGED Viewed

@@ -1,16 +1,19 @@
 ---
 inference: false
 base_model:
 - SanjiWatsuki/Silicon-Maid-7B
 - sethuiyer/Aika-7B
 - sethuiyer/Nandine-7b
 - mlabonne/AlphaMonarch-7B
-library_name: transformers
-tags:
-- mergekit
-- merge
-- not-for-all-audiences
-license: cc
 model-index:
 - name: sethuiyer/Diana-7B
   results:
@@ -29,8 +32,7 @@ model-index:
       value: 68.34
       name: normalized accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -46,8 +48,7 @@ model-index:
       value: 86.73
       name: normalized accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -64,8 +65,7 @@ model-index:
       value: 64.58
       name: accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -81,8 +81,7 @@ model-index:
     - type: mc2
       value: 60.55
     source:
-      url: >-
-        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -99,8 +98,7 @@ model-index:
       value: 80.19
       name: accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
@@ -117,12 +115,8 @@ model-index:
       value: 63.23
       name: accuracy
     source:
-      url: >-
-        https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
-language:
-- en
-pipeline_tag: text-generation
 ---
 # Diana-7B
@@ -198,3 +192,17 @@ GGUF files are available at [Diana-7B-GGUF](https://huggingface.co/sethuiyer/Dia
 Diana is now available on Ollama. You can use it by running the command ```ollama run stuehieyr/diana``` in your
 terminal. If you have limited computing resources, check out this [video](https://www.youtube.com/watch?v=Qa1h7ygwQq8) to learn how to run it on
 a Google Colab backend.

 ---
+language:
+- en
+license: cc
+library_name: transformers
+tags:
+- mergekit
+- merge
+- not-for-all-audiences
 inference: false
 base_model:
 - SanjiWatsuki/Silicon-Maid-7B
 - sethuiyer/Aika-7B
 - sethuiyer/Nandine-7b
 - mlabonne/AlphaMonarch-7B
+pipeline_tag: text-generation
 model-index:
 - name: sethuiyer/Diana-7B
   results:
       value: 68.34
       name: normalized accuracy
     source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 86.73
       name: normalized accuracy
     source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 64.58
       name: accuracy
     source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
     - type: mc2
       value: 60.55
     source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 80.19
       name: accuracy
     source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
   - task:
       type: text-generation
       value: 63.23
       name: accuracy
     source:
+      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Diana-7B
       name: Open LLM Leaderboard
 ---
 # Diana-7B
 Diana is now available on Ollama. You can use it by running the command ```ollama run stuehieyr/diana``` in your
 terminal. If you have limited computing resources, check out this [video](https://www.youtube.com/watch?v=Qa1h7ygwQq8) to learn how to run it on
 a Google Colab backend.
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_sethuiyer__Diana-7B)
+|             Metric              |Value|
+|---------------------------------|----:|
+|Avg.                             |70.60|
+|AI2 Reasoning Challenge (25-Shot)|68.34|
+|HellaSwag (10-Shot)              |86.73|
+|MMLU (5-Shot)                    |64.58|
+|TruthfulQA (0-shot)              |60.55|
+|Winogrande (5-shot)              |80.19|
+|GSM8k (5-shot)                   |63.23|