saishf commited on
Commit
790ec48
1 Parent(s): e3e7828

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -44
README.md CHANGED
@@ -1,44 +1,47 @@
1
- ---
2
- base_model:
3
- - saishf/Long-Neural-SOVLish-Devil-8B-L3-262K
4
- - saishf/Merge-Mayhem-L3-V2
5
- - saishf/Neural-SOVLish-Devil-8B-L3
6
- - saishf/SOVLish-Maid-L3-8B
7
- - saishf/Merge-Mayhem-L3-V2.1
8
- library_name: transformers
9
- tags:
10
- - mergekit
11
- - merge
12
-
13
- ---
14
- # merge
15
-
16
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
17
-
18
- ## Merge Details
19
- ### Merge Method
20
-
21
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [saishf/Long-Neural-SOVLish-Devil-8B-L3-262K](https://huggingface.co/saishf/Long-Neural-SOVLish-Devil-8B-L3-262K) as a base.
22
-
23
- ### Models Merged
24
-
25
- The following models were included in the merge:
26
- * [saishf/Merge-Mayhem-L3-V2](https://huggingface.co/saishf/Merge-Mayhem-L3-V2)
27
- * [saishf/Neural-SOVLish-Devil-8B-L3](https://huggingface.co/saishf/Neural-SOVLish-Devil-8B-L3)
28
- * [saishf/SOVLish-Maid-L3-8B](https://huggingface.co/saishf/SOVLish-Maid-L3-8B)
29
- * [saishf/Merge-Mayhem-L3-V2.1](https://huggingface.co/saishf/Merge-Mayhem-L3-V2.1)
30
-
31
- ### Configuration
32
-
33
- The following YAML configuration was used to produce this model:
34
-
35
- ```yaml
36
- models:
37
- - model: saishf/Neural-SOVLish-Devil-8B-L3
38
- - model: saishf/Merge-Mayhem-L3-V2
39
- - model: saishf/Merge-Mayhem-L3-V2.1
40
- - model: saishf/SOVLish-Maid-L3-8B
41
- merge_method: model_stock
42
- base_model: saishf/Long-Neural-SOVLish-Devil-8B-L3-262K
43
- dtype: bfloat16
44
- ```
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - saishf/Long-Neural-SOVLish-Devil-8B-L3-262K
4
+ - saishf/Merge-Mayhem-L3-V2
5
+ - saishf/Neural-SOVLish-Devil-8B-L3
6
+ - saishf/SOVLish-Maid-L3-8B
7
+ - saishf/Merge-Mayhem-L3-V2.1
8
+ library_name: transformers
9
+ tags:
10
+ - mergekit
11
+ - merge
12
+ license: cc-by-nc-4.0
13
+ ---
14
+ # merge
15
+
16
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
17
+
18
+ ## Merge Details
19
+ **Experimental**
20
+
21
+ This model is a attempt to push [saishf/SOVL-Mega-Mash-V2-L3-8B](https://huggingface.co/saishf/SOVL-Mega-Mash-V2-L3-8B) (my personal favourite model) to support 32K+ context.
22
+ ### Merge Method
23
+
24
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [saishf/Long-Neural-SOVLish-Devil-8B-L3-262K](https://huggingface.co/saishf/Long-Neural-SOVLish-Devil-8B-L3-262K) as a base.
25
+
26
+ ### Models Merged
27
+
28
+ The following models were included in the merge:
29
+ * [saishf/Merge-Mayhem-L3-V2](https://huggingface.co/saishf/Merge-Mayhem-L3-V2)
30
+ * [saishf/Neural-SOVLish-Devil-8B-L3](https://huggingface.co/saishf/Neural-SOVLish-Devil-8B-L3)
31
+ * [saishf/SOVLish-Maid-L3-8B](https://huggingface.co/saishf/SOVLish-Maid-L3-8B)
32
+ * [saishf/Merge-Mayhem-L3-V2.1](https://huggingface.co/saishf/Merge-Mayhem-L3-V2.1)
33
+
34
+ ### Configuration
35
+
36
+ The following YAML configuration was used to produce this model:
37
+
38
+ ```yaml
39
+ models:
40
+ - model: saishf/Neural-SOVLish-Devil-8B-L3
41
+ - model: saishf/Merge-Mayhem-L3-V2
42
+ - model: saishf/Merge-Mayhem-L3-V2.1
43
+ - model: saishf/SOVLish-Maid-L3-8B
44
+ merge_method: model_stock
45
+ base_model: saishf/Long-Neural-SOVLish-Devil-8B-L3-262K
46
+ dtype: bfloat16
47
+ ```