Update README.md
Browse files
README.md
CHANGED
@@ -128,11 +128,12 @@ print(predict_score) # 1
|
|
128 |
```
|
129 |
|
130 |
### **Heatmap Visualize**
|
131 |
-
|
132 |
The training and test datasets are not separated, so we can only see how well the model learned.
|
133 |
-
|
134 |
-
- llama3-8b-it-prometheus-ko (korean train-> korean inference) # total 200 # in this [testset](https://huggingface.co/datasets/nayohan/feedback-collection-ko-chat/viewer/default/test)
|
135 |
|
|
|
|
|
136 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6152b4b9ecf3ca6ab820e325/ssZRGTysyiOZD4ttNOD4s.png)
|
137 |
|
138 |
### **Citation**
|
|
|
128 |
```
|
129 |
|
130 |
### **Heatmap Visualize**
|
131 |
+
For [eng->eng] we randomly sampled 200 evalset from the training data, extracted scores from the model-generated sentences, and compared them to the correct answers.
|
132 |
The training and test datasets are not separated, so we can only see how well the model learned.
|
133 |
+
For [ko->ko] sampled 200 evalset in this [testset](https://huggingface.co/datasets/nayohan/feedback-collection-ko-chat/viewer/default/test). llama3-8b-it-prometheus-ko only use trin set.
|
|
|
134 |
|
135 |
+
- prometheus-7b-v1.0 (english train-> english inference) # 3 failed to output a score, total 197
|
136 |
+
- llama3-8b-it-prometheus-ko (korean train-> korean inference) # total 200
|
137 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6152b4b9ecf3ca6ab820e325/ssZRGTysyiOZD4ttNOD4s.png)
|
138 |
|
139 |
### **Citation**
|