Update README.md
Browse files
README.md
CHANGED
@@ -10,21 +10,17 @@ license: apache-2.0
|
|
10 |
language:
|
11 |
- en
|
12 |
---
|
13 |
-
|
14 |
|
15 |
-
This
|
16 |
-
|
17 |
-
|
18 |
-
### Merge Method
|
19 |
-
|
20 |
-
This model was merged using the linear [DARE](https://arxiv.org/abs/2311.03099) merge method using [cognitivecomputations/dolphin-2.9.1-qwen-110b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-qwen-110b) as a base.
|
21 |
|
22 |
-
|
|
|
23 |
|
24 |
-
The following models were included in the merge:
|
25 |
-
* [Qwen/Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)
|
26 |
|
27 |
-
|
28 |
|
29 |
The following YAML configuration was used to produce this model:
|
30 |
|
@@ -47,4 +43,8 @@ slices:
|
|
47 |
layer_range: [0, 80]
|
48 |
parameters:
|
49 |
weight: 0.4
|
50 |
-
```
|
|
|
|
|
|
|
|
|
|
10 |
language:
|
11 |
- en
|
12 |
---
|
13 |
+
## aqua-qwen-0.1-110B
|
14 |
|
15 |
+
This model was created by merging 2 models using the linear [DARE](https://arxiv.org/abs/2311.03099) merge method
|
16 |
+
using [mergekit](https://github.com/arcee-ai/mergekit).
|
17 |
+
The following models were included in the merge:
|
|
|
|
|
|
|
18 |
|
19 |
+
- [cognitivecomputations/dolphin-2.9.1-qwen-110b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-qwen-110b) as a base.
|
20 |
+
- [Qwen/Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)
|
21 |
|
|
|
|
|
22 |
|
23 |
+
## Configuration
|
24 |
|
25 |
The following YAML configuration was used to produce this model:
|
26 |
|
|
|
43 |
layer_range: [0, 80]
|
44 |
parameters:
|
45 |
weight: 0.4
|
46 |
+
```
|
47 |
+
|
48 |
+
## Usage
|
49 |
+
|
50 |
+
It is recommended to use GGUF version of the model [available here](https://huggingface.co/saucam/aqua-qwen-0.1-110B-GGUF/blob/main/README.md)
|