MarinaraSpaghetti commited on
Commit
3435b27
1 Parent(s): c993152

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +63 -39
README.md CHANGED
@@ -1,39 +1,63 @@
1
- ---
2
- base_model: []
3
- library_name: transformers
4
- tags:
5
- - mergekit
6
- - merge
7
-
8
- ---
9
- # Nemomix-v2.0-12B
10
-
11
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
12
-
13
- ## Merge Details
14
- ### Merge Method
15
-
16
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using F:\mergekit\mistralaiMistral-Nemo-Base-2407 as a base.
17
-
18
- ### Models Merged
19
-
20
- The following models were included in the merge:
21
- * F:\mergekit\intervitens_mini-magnum-12b-v1.1
22
- * F:\mergekit\NeverSleep_Lumimaid-v0.2-12B
23
- * F:\mergekit\mistralaiMistral-Nemo-Instruct-2407
24
-
25
- ### Configuration
26
-
27
- The following YAML configuration was used to produce this model:
28
-
29
- ```yaml
30
- models:
31
- - model: F:\mergekit\NeverSleep_Lumimaid-v0.2-12B
32
- - model: F:\mergekit\intervitens_mini-magnum-12b-v1.1
33
- - model: F:\mergekit\mistralaiMistral-Nemo-Instruct-2407
34
- merge_method: model_stock
35
- base_model: F:\mergekit\mistralaiMistral-Nemo-Base-2407
36
- parameters:
37
- filter_wise: false
38
- dtype: bfloat16
39
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: []
3
+ library_name: transformers
4
+ tags:
5
+ - mergekit
6
+ - merge
7
+
8
+ ---
9
+ # Description
10
+
11
+ My main goal is to merge the smartness of the base Instruct Nemo with the better prose from the different roleplaying fine-tunes. This is version v0.2, still to be tested. Not sure if it's better than v1.0.
12
+
13
+ # Instruct
14
+
15
+ Mistral Instruct.
16
+
17
+ ```
18
+ <s>[INST] {system} [/INST]{response}</s>[INST] {prompt} [/INST]
19
+ ```
20
+
21
+ # GGUF
22
+
23
+ https://huggingface.co/MarinaraSpaghetti/Nemomix-v2.0-12B-GGUF
24
+
25
+ # V1.0
26
+
27
+ https://huggingface.co/MarinaraSpaghetti/Nemomix-v0.1-12B
28
+
29
+ # Settings
30
+
31
+ Lower Temperature recommended, although I had luck with Temperatures above one (1.0-1.2) if you crank up the Min P (0.01-0.1). Run with base DRY of 0.8/1.75/2/0 and you're good to go.
32
+
33
+ # Nemomix-v2.0-12B
34
+
35
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
36
+
37
+ ## Merge Details
38
+ ### Merge Method
39
+
40
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using F:\mergekit\mistralaiMistral-Nemo-Base-2407 as a base.
41
+
42
+ ### Models Merged
43
+
44
+ The following models were included in the merge:
45
+ * F:\mergekit\intervitens_mini-magnum-12b-v1.1
46
+ * F:\mergekit\NeverSleep_Lumimaid-v0.2-12B
47
+ * F:\mergekit\mistralaiMistral-Nemo-Instruct-2407
48
+
49
+ ### Configuration
50
+
51
+ The following YAML configuration was used to produce this model:
52
+
53
+ ```yaml
54
+ models:
55
+ - model: F:\mergekit\NeverSleep_Lumimaid-v0.2-12B
56
+ - model: F:\mergekit\intervitens_mini-magnum-12b-v1.1
57
+ - model: F:\mergekit\mistralaiMistral-Nemo-Instruct-2407
58
+ merge_method: model_stock
59
+ base_model: F:\mergekit\mistralaiMistral-Nemo-Base-2407
60
+ parameters:
61
+ filter_wise: false
62
+ dtype: bfloat16
63
+ ```