ndhananj commited on
Commit
8bb8ffe
1 Parent(s): 32173b5

Update README.md

Browse files

Make the model card for descriptions.

Files changed (1) hide show
  1. README.md +21 -1
README.md CHANGED
@@ -6,10 +6,30 @@ tags: []
6
  # Model Card for Model ID
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
-
10
 
11
 
12
  ## Model Details
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
  ### Model Description
15
 
 
6
  # Model Card for Model ID
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
+ This model was a usses LLama3.2-1B-Instruct as a base. It does better **50%** than the same fintuning on ElutherAI/gpt-neo-1.3B on the HellaSwag benchmark for instruction following.
10
 
11
 
12
  ## Model Details
13
+ # Model Card
14
+
15
+ ## Model Description
16
+
17
+ This is an ORPO fine-tune of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on a dataset of [mlabonne/orpo-dpo-mix-40k](https://huggingface.co/datasets/mlabonne/orpo-dpo-mix-40k).
18
+
19
+ ## Evaluation Results
20
+
21
+ ### Hellaswag for this model
22
+ | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
23
+ |---------|------:|------|-----:|--------|---|-----:|---|-----:|
24
+ |hellaswag| 1|none | 0|acc |↑ |0.4501|± |0.0050|
25
+ | | |none | 0|acc_norm|↑ |0.6072|± |0.0049|
26
+
27
+ ### Hellaswag for same fine-tuning for ElutherAI/gpt-neo-1.3B
28
+
29
+ | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
30
+ |---------|------:|------|-----:|--------|---|-----:|---|-----:|
31
+ |hellaswag| 1|none | 0|acc |↑ |0.3853|± |0.0049|
32
+ | | |none | 0|acc_norm|↑ |0.4891|± |0.0050|
33
 
34
  ### Model Description
35