LemiSt commited on
Commit
d032d3e
1 Parent(s): 1a065d9

added evaluation results

Browse files
Files changed (1) hide show
  1. README.md +43 -1
README.md CHANGED
@@ -7,7 +7,49 @@ tags:
7
  - generated_from_trainer
8
  model-index:
9
  - name: SmolLM-135M-instruct-de-merged
10
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  language:
12
  - de
13
  pipeline_tag: text-generation
 
7
  - generated_from_trainer
8
  model-index:
9
  - name: SmolLM-135M-instruct-de-merged
10
+ results:
11
+ - task:
12
+ type: text-generation
13
+ dataset:
14
+ name: openai/MMMLU
15
+ type: mmlu
16
+ metrics:
17
+ - name: MMMLU(DE_DE) (0-Shot)
18
+ type: MMMLU(DE_DE) (0-Shot)
19
+ value: 25.57
20
+ verified: false
21
+ - task:
22
+ type: text-generation
23
+ dataset:
24
+ name: alexandrainst/m_arc
25
+ type: arc
26
+ metrics:
27
+ - name: ARC Challenge (DE) (0-Shot)
28
+ type: ARC Challenge (DE) (0-Shot)
29
+ value: 24.29
30
+ verified: false
31
+ - task:
32
+ type: text-generation
33
+ dataset:
34
+ name: deutsche-telekom/Ger-RAG-eval
35
+ type: Ger-RAG-eval
36
+ metrics:
37
+ - name: Ger-RAG-eval Choose Context By Question
38
+ type: Ger-RAG-eval Task 1
39
+ value: 25.2
40
+ verified: false
41
+ - name: Ger-RAG-eval Choose Question By Context
42
+ type: Ger-RAG-eval Task 2
43
+ value: 27.1
44
+ verified: false
45
+ - name: Ger-RAG-eval Context Question Match
46
+ type: Ger-RAG-eval Task 3
47
+ value: 50.9
48
+ verified: false
49
+ - name: Ger-RAG-eval Question Answer Match
50
+ type: Ger-RAG-eval Task 4
51
+ value: 50.0
52
+ verified: false
53
  language:
54
  - de
55
  pipeline_tag: text-generation