saucam
/

llama-airo-3

Generated from Trainer

Model card Files Files and versions Community

saucam commited on Apr 21

Commit

4514eea

•

1 Parent(s): f4b62a8

Update README.md

Files changed (1) hide show

README.md +17 -16

README.md CHANGED Viewed

@@ -9,29 +9,18 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
-# Details
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the jondurbin/airoboros-3.2 dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.8437
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
@@ -64,4 +53,16 @@ The following hyperparameters were used during training:
 - Transformers 4.40.0.dev0
 - Pytorch 2.1.2+cu118
 - Datasets 2.15.0
-- Tokenizers 0.15.0

   results: []
 ---
+![](https://raw.githubusercontent.com/saucam/models/main/llama-aero.png)
+# llama-airo-3
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
+## Details
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the jondurbin/airoboros-3.2 dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.8437
 ## Training procedure
 ### Training hyperparameters
 - Transformers 4.40.0.dev0
 - Pytorch 2.1.2+cu118
 - Datasets 2.15.0
+- Tokenizers 0.15.0
+## Eval Results
+ |Benchmark|                          Model                           |agieval|gpt4all|bigbench|truthfulqa|Average|
+|---------|----------------------------------------------------------|------:|------:|-------:|---------:|------:|
+|nous     |[llama-airo-3](https://huggingface.co/saucam/llama-airo-3)|  36.59|  72.24|   39.26|      56.3|   51.1|
+ |Benchmark|                          Model                           |winogrande| arc |gsm8k|mmlu |truthfulqa|hellaswag|Average|
+|---------|----------------------------------------------------------|---------:|----:|----:|----:|---------:|--------:|------:|
+|openllm  |[llama-airo-3](https://huggingface.co/saucam/llama-airo-3)|     78.22|61.01|56.33|64.79|     56.35|    82.42|  66.52|
+Detailed Results: https://github.com/saucam/model_evals/tree/main/saucam/llama-airo-3