saucam
/

aqua-qwen-0.1-110B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

saucam commited on May 14

Commit

b292d08

•

1 Parent(s): b740c4e

Update README.md

Files changed (1) hide show

README.md +12 -12

README.md CHANGED Viewed

@@ -10,21 +10,17 @@ license: apache-2.0
 language:
 - en
 ---
-# merge
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the linear [DARE](https://arxiv.org/abs/2311.03099) merge method using [cognitivecomputations/dolphin-2.9.1-qwen-110b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-qwen-110b) as a base.
-### Models Merged
-The following models were included in the merge:
-* [Qwen/Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)
-### Configuration
 The following YAML configuration was used to produce this model:
@@ -47,4 +43,8 @@ slices:
       layer_range: [0, 80]
       parameters:
         weight: 0.4
-```

 language:
 - en
 ---
+## aqua-qwen-0.1-110B
+This model was created by merging 2 models using the linear [DARE](https://arxiv.org/abs/2311.03099) merge method
+using [mergekit](https://github.com/arcee-ai/mergekit).
+The following models were included in the merge:
+- [cognitivecomputations/dolphin-2.9.1-qwen-110b](https://huggingface.co/cognitivecomputations/dolphin-2.9.1-qwen-110b) as a base.
+- [Qwen/Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)
+## Configuration
 The following YAML configuration was used to produce this model:
       layer_range: [0, 80]
       parameters:
         weight: 0.4
+```
+## Usage
+It is recommended to use GGUF version of the model [available here](https://huggingface.co/saucam/aqua-qwen-0.1-110B-GGUF/blob/main/README.md)