Clevyby Lewdiculous commited on
Commit
42ab19f
0 Parent(s):

Duplicate from Lewdiculous/Llama-3-Lumimaid-8B-v0.1-OAS-GGUF-IQ-Imatrix

Browse files

Co-authored-by: Lewdiculous <[email protected]>

.gitattributes ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ imatrix.dat filter=lfs diff=lfs merge=lfs -text
37
+ Llama-3-Lumimaid-8B-v0.1-OAS-Q4_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
38
+ Llama-3-Lumimaid-8B-v0.1-OAS-Q4_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
39
+ Llama-3-Lumimaid-8B-v0.1-OAS-Q5_K_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
40
+ Llama-3-Lumimaid-8B-v0.1-OAS-Q5_K_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Llama-3-Lumimaid-8B-v0.1-OAS-Q6_K-imat.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Llama-3-Lumimaid-8B-v0.1-OAS-Q8_0-imat.gguf filter=lfs diff=lfs merge=lfs -text
43
+ Llama-3-Lumimaid-8B-v0.1-OAS-F16.gguf filter=lfs diff=lfs merge=lfs -text
44
+ Llama-3-Lumimaid-8B-v0.1-OAS-IQ3_M-imat.gguf filter=lfs diff=lfs merge=lfs -text
45
+ Llama-3-Lumimaid-8B-v0.1-OAS-IQ3_S-imat.gguf filter=lfs diff=lfs merge=lfs -text
46
+ Llama-3-Lumimaid-8B-v0.1-OAS-IQ3_XXS-imat.gguf filter=lfs diff=lfs merge=lfs -text
47
+ Llama-3-Lumimaid-8B-v0.1-OAS-IQ4_NL-imat.gguf filter=lfs diff=lfs merge=lfs -text
48
+ Llama-3-Lumimaid-8B-v0.1-OAS-IQ4_XS-imat.gguf filter=lfs diff=lfs merge=lfs -text
Llama-3-Lumimaid-8B-v0.1-OAS-F16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f3c9f10efe36bacf6feed9987bcc7862895148e546dadadd8edf4b5912fb975
3
+ size 16068890432
Llama-3-Lumimaid-8B-v0.1-OAS-IQ3_M-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67ceef34e89ebeafcbd3e326d55503a324788d9723a7854e6a8fbba0ca503cc4
3
+ size 3784822976
Llama-3-Lumimaid-8B-v0.1-OAS-IQ3_S-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6377a1ea708ed9bd5a5fa799e70359f4090164a657adc79109f53a5b37572488
3
+ size 3682324672
Llama-3-Lumimaid-8B-v0.1-OAS-IQ3_XXS-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fec41a66fc849baa9ed963f60c8b45602f3cb6841fc605d1132f1527639f4c53
3
+ size 3274911936
Llama-3-Lumimaid-8B-v0.1-OAS-IQ4_NL-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ed9af1032ca66320b8d1a6c072efdfa356bdbd9c5682a8966c29d1e20949f0f
3
+ size 4677988544
Llama-3-Lumimaid-8B-v0.1-OAS-IQ4_XS-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a62e790d2593df40910a9535f1bd5ecff208f203ad5fcf86a1e9c91dc8221fbb
3
+ size 4447662272
Llama-3-Lumimaid-8B-v0.1-OAS-Q4_K_M-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1199440aa13c55f5f2cad1cb215535306f21e52a81de23f80a9e3586c8ac1c50
3
+ size 4920733888
Llama-3-Lumimaid-8B-v0.1-OAS-Q4_K_S-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6cec716e1544579dd291bed88e9898c759418caf1bf807125de312376c08cea3
3
+ size 4692668608
Llama-3-Lumimaid-8B-v0.1-OAS-Q5_K_M-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af568492efc49e32da42fd88310ad3a523a3baedc6c3b014f19ff21afe343b72
3
+ size 5732987072
Llama-3-Lumimaid-8B-v0.1-OAS-Q5_K_S-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85df1a759098ea6d9355fa17a561b7230ad35f465d40fd9c6dc10c38526831ca
3
+ size 5599293632
Llama-3-Lumimaid-8B-v0.1-OAS-Q6_K-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:077a9777880d94aaa1e1cdd7c699eac5fc72f6372c45d218738cf932e287146c
3
+ size 6596006080
Llama-3-Lumimaid-8B-v0.1-OAS-Q8_0-imat.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9306bb3e51905a01a5960c646126a55801f929dad0faebd48876179d6c4d7b9
3
+ size 8540770496
README.md ADDED
@@ -0,0 +1,104 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ tags:
4
+ - roleplay
5
+ - llama3
6
+ - sillytavern
7
+ ---
8
+
9
+ > [!TIP]
10
+ > **Support:** <br>
11
+ > My upload speeds have been cooked and unstable lately. <br>
12
+ > Realistically I'd need to move to get a better provider. <br>
13
+ > If you **want** and you are able to... <br>
14
+ > You can [**support my various endeavors here (Ko-fi)**](https://ko-fi.com/Lewdiculous). <br>
15
+ > I apologize for disrupting your experience.
16
+
17
+ GGUF-IQ-Imatrix quants for [NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS).
18
+
19
+ **Author:** <br>
20
+ "This model received the Orthogonal Activation Steering treatment, **meaning it will rarely refuse any request.**"
21
+
22
+ > [!IMPORTANT]
23
+ > **Relevant:** <br>
24
+ > These quants have been done after the fixes from [**llama.cpp/pull/6920**](https://github.com/ggerganov/llama.cpp/pull/6920) have been merged. <br>
25
+ > Use **KoboldCpp** version **1.64** or higher, make sure you're up-to-date.
26
+
27
+ > [!WARNING]
28
+ > Compatible SillyTavern presets [here (simple)](https://huggingface.co/ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B/tree/main/Official%20Poppy%20Porpoise%20ST%20Presets)) or [here (Virt's Roleplay Presets - recommended)](https://huggingface.co/Virt-io/SillyTavern-Presets). <br>
29
+ > Use the latest version of KoboldCpp. **Use the provided presets for testing.** <br>
30
+ > Feedback and support for the Authors is always welcome. <br>
31
+ > If there are any issues or questions let me know.
32
+
33
+ > [!NOTE]
34
+ > For **8GB VRAM** GPUs, I recommend the **Q4_K_M-imat** quant for up to 12288 context sizes.
35
+
36
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65d4cf2693a0a3744a27536c/JUxfdTot7v7LTdIGYyzYM.png)
37
+
38
+ **Original model information:**
39
+
40
+ ## Lumimaid 0.1
41
+
42
+ <center><div style="width: 100%;">
43
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/630dfb008df86f1e5becadc3/d3QMaxy3peFTpSlWdWF-k.png" style="display: block; margin: auto;">
44
+ </div></center>
45
+
46
+ This model uses the Llama3 **prompting format**
47
+
48
+ Llama3 trained on our RP datasets, we tried to have a balance between the ERP and the RP, not too horny, but just enough.
49
+
50
+ We also added some non-RP dataset, making the model less dumb overall. It should look like a 40%/60% ratio for Non-RP/RP+ERP data.
51
+
52
+ This model includes the new Luminae dataset from Ikari.
53
+
54
+ This model have received the Orthogonal Activation Steering treatment, meaning it will rarely refuse any request.
55
+
56
+ If you consider trying this model please give us some feedback either on the Community tab on hf or on our [Discord Server](https://discord.gg/MtCVRWTZXY).
57
+
58
+ ## Credits:
59
+ - Undi
60
+ - IkariDev
61
+
62
+ ## Description
63
+
64
+ This repo contains FP16 files of Lumimaid-8B-v0.1-OAS.
65
+
66
+ Switch: [8B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1) - [70B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1) - [70B-alt](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1-alt) - [8B-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS) - [70B-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1-OAS)
67
+
68
+ ## Training data used:
69
+ - [Aesir datasets](https://huggingface.co/MinervaAI)
70
+ - [NoRobots](https://huggingface.co/datasets/Doctor-Shotgun/no-robots-sharegpt)
71
+ - [limarp](https://huggingface.co/datasets/lemonilia/LimaRP) - 8k ctx
72
+ - [toxic-dpo-v0.1-sharegpt](https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
73
+ - [ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
74
+ - Luminae-i1 (70B/70B-alt) (i2 was not existing when the 70b started training) | Luminae-i2 (8B) (this one gave better results on the 8b) - Ikari's Dataset
75
+ - [Squish42/bluemoon-fandom-1-1-rp-cleaned](https://huggingface.co/datasets/Squish42/bluemoon-fandom-1-1-rp-cleaned) - 50% (randomly)
76
+ - [NobodyExistsOnTheInternet/PIPPAsharegptv2test](https://huggingface.co/datasets/NobodyExistsOnTheInternet/PIPPAsharegptv2test) - 5% (randomly)
77
+ - [cgato/SlimOrcaDedupCleaned](https://huggingface.co/datasets/cgato/SlimOrcaDedupCleaned) - 5% (randomly)
78
+ - Airoboros (reduced)
79
+ - [Capybara](https://huggingface.co/datasets/Undi95/Capybara-ShareGPT/) (reduced)
80
+
81
+
82
+ ## Models used (only for 8B)
83
+
84
+ - Initial LumiMaid 8B Finetune
85
+ - Undi95/Llama-3-Unholy-8B-e4
86
+ - Undi95/Llama-3-LewdPlay-8B
87
+
88
+ ## Prompt template: Llama3
89
+
90
+ ```
91
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
92
+
93
+ {system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
94
+
95
+ {input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
96
+
97
+ {output}<|eot_id|>
98
+ ```
99
+
100
+ ## Others
101
+
102
+ Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).
103
+
104
+ IkariDev: Visit my [retro/neocities style website](https://ikaridevgit.github.io/) please kek
imatrix-with-rp-ex.txt ADDED
The diff for this file is too large to render. See raw diff
 
imatrix.dat ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bbdaf55a2765eacbf63f9e2a0358f0d97aafe48f7f1228b2c1489f92a4666e23
3
+ size 4988193