v000000 commited on
Commit
c3ae0fc
1 Parent(s): 5715db0

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - nothingiisreal/L3.1-8B-Celeste-V1.5
4
+ - Sao10K/Llama-3.1-8B-Stheno-v3.4
5
+ - Sao10K/L3.1-8B-Niitama-v1.1
6
+ - arcee-ai/Llama-3.1-SuperNova-Lite
7
+ - akjindal53244/Llama-3.1-Storm-8B
8
+ - arcee-ai/Llama-Spark
9
+ - grimjim/Llama-3-Instruct-abliteration-LoRA-8B
10
+ - crestf411/sunfall-peft
11
+
12
+ tags:
13
+ - llama
14
+ - merge
15
+ - llama3
16
+ - mixtral
17
+ ---
18
+
19
+ # Llama-3.1-Celestial-Stone-2x8B (BF16)
20
+
21
+ * *Mixture of Experts (14B).*
22
+
23
+ Both experts are used in tandem when generating a token.
24
+
25
+ ------------------------------------------------------------------------------
26
+
27
+ *The first expert* is Instruct 405B distillation/RP vector merge with a bit of abliteration <b>(Supernova-Lite, Niitama1.1, Storm)</b>
28
+
29
+ *The second expert* is ERP/Reddit data merge with a bit of abliteration <b>(Celeste1.5, Stheno3.4, Storm)</b>
30
+
31
+ -------------------------------------------------------------------------------
32
+
33
+ *The base model* is <b>Sao10k/L3.1-Stheno-3.4</b> with the <b>Sunfall LoRa 0.6.1</b> to make it understand SillyTavern prompts and storywriting better.
34
+
35
+ -------------------------------------------------------------------------------
36
+
37
+ # Prompt Template:
38
+ ```bash
39
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
40
+
41
+ {system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
42
+
43
+ {input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
44
+
45
+ {output}<|eot_id|>
46
+
47
+ ```