fblgit
/

cybertron-v4-qw7B-UNAMGS

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

fblgit commited on 9 days ago

Commit

4b9fbda

•

1 Parent(s): d8c42bf

Update README.md

Files changed (1) hide show

README.md +27 -15

README.md CHANGED Viewed

@@ -111,6 +111,7 @@ model-index:
 # cybertron-v4-qw7B-UNAMGS
 **UNA IS BACK** Cybertron v4 UNA-MGS, Based on the amazing Qwen2.5 7B
 **SCORING #1 7-8B LLM WITH NO CONTAMINATION 21.11.2024 with avg. 31.82**
 ![cybertron-v4-MGS](https://huggingface.co/fblgit/cybertron-v4-qw7B-MGS/resolve/main/cybertron_v4MGS.png)
@@ -121,9 +122,34 @@ Here we use our novel approach called `MGS`. Its up to you to figure out what it
 Cybertron V4 went thru SFT with `MGS & UNA`  over `Magpie-Align/Magpie-Qwen2.5-Pro-1M-v0.1` dataset.
 ## Quantz
 Soon..
 ## MGS & UNA & Details
 * MGS, `1+1 = 2 and not 3`
@@ -219,18 +245,4 @@ The following hyperparameters were used during training:
       journal={arXiv preprint arXiv:2407.10671},
       year={2024}
 }
-```
-# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
-Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__cybertron-v4-qw7B-UNAMGS)
-|      Metric       |Value|
-|-------------------|----:|
-|Avg.               |31.82|
-|IFEval (0-Shot)    |60.84|
-|BBH (3-Shot)       |37.71|
-|MATH Lvl 5 (4-Shot)|29.91|
-|GPQA (0-shot)      |10.85|
-|MuSR (0-shot)      |12.69|
-|MMLU-PRO (5-shot)  |38.89|

 # cybertron-v4-qw7B-UNAMGS
 **UNA IS BACK** Cybertron v4 UNA-MGS, Based on the amazing Qwen2.5 7B
 **SCORING #1 7-8B LLM WITH NO CONTAMINATION 21.11.2024 with avg. 31.82**
 ![cybertron-v4-MGS](https://huggingface.co/fblgit/cybertron-v4-qw7B-MGS/resolve/main/cybertron_v4MGS.png)
 Cybertron V4 went thru SFT with `MGS & UNA`  over `Magpie-Align/Magpie-Qwen2.5-Pro-1M-v0.1` dataset.
+## Contamination Benchmark
+- MATH:
+```
+5gram-Qwen2.5-7B-Instruct-orgn-MATH-test.jsonl: 37.52666666666667
+5gram-Qwen2.5-7B-Instruct-orgn-MATH-train.jsonl: 46.36666666666667
+```
+vs
+```
+5gram-UNA-cybertron-v4-qw7B-MGS-orgn-MATH-test.jsonl: 37.42666666666667
+5gram-UNA-cybertron-v4-qw7B-MGS-orgn-MATH-train.jsonl: 46.053333333333335
+```
 ## Quantz
 Soon..
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__cybertron-v4-qw7B-UNAMGS)
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |31.82|
+|IFEval (0-Shot)    |60.84|
+|BBH (3-Shot)       |37.71|
+|MATH Lvl 5 (4-Shot)|29.91|
+|GPQA (0-shot)      |10.85|
+|MuSR (0-shot)      |12.69|
+|MMLU-PRO (5-shot)  |38.89|
 ## MGS & UNA & Details
 * MGS, `1+1 = 2 and not 3`
       journal={arXiv preprint arXiv:2407.10671},
       year={2024}
 }
+```