Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,9 @@
|
|
2 |
license: apache-2.0
|
3 |
datasets:
|
4 |
- argilla/distilabel-intel-orca-dpo-pairs
|
|
|
|
|
|
|
5 |
---
|
6 |
|
7 |
# solarized-18B-dpo
|
@@ -9,4 +12,4 @@ datasets:
|
|
9 |
DPO'd from vicgalle/franken-SOLAR-18B-v1.0, a SOLAR-like model upscaled to 18B.
|
10 |
It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.
|
11 |
|
12 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png)
|
|
|
2 |
license: apache-2.0
|
3 |
datasets:
|
4 |
- argilla/distilabel-intel-orca-dpo-pairs
|
5 |
+
tags:
|
6 |
+
- dpo
|
7 |
+
- 18B
|
8 |
---
|
9 |
|
10 |
# solarized-18B-dpo
|
|
|
12 |
DPO'd from vicgalle/franken-SOLAR-18B-v1.0, a SOLAR-like model upscaled to 18B.
|
13 |
It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.
|
14 |
|
15 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png)
|