ArchiveAI Pyroserenus commited on
Commit
1ebc0a5
0 Parent(s):

Duplicate from Pyroserenus/Orthrus-12b-v0.8-Q6_K-GGUF

Browse files

Co-authored-by: Pyroserenus <[email protected]>

Files changed (4) hide show
  1. .gitattributes +37 -0
  2. README.md +53 -0
  3. imatrix.dat +3 -0
  4. orthrus-12b-v0.8-q6_k.gguf +3 -0
.gitattributes ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ orthrus-12b-v0.8-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
37
+ imatrix.dat filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Pyroserenus/Orthrus-12b-v0.8
3
+ library_name: transformers
4
+ tags:
5
+ - mergekit
6
+ - merge
7
+ - llama-cpp
8
+ - gguf-my-repo
9
+ ---
10
+
11
+ # Pyroserenus/Orthrus-12b-v0.8-Q6_K-GGUF
12
+ This model was converted to GGUF format from [`Pyroserenus/Orthrus-12b-v0.8`](https://huggingface.co/Pyroserenus/Orthrus-12b-v0.8) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
13
+ Refer to the [original model card](https://huggingface.co/Pyroserenus/Orthrus-12b-v0.8) for more details on the model.
14
+
15
+ ## Use with llama.cpp
16
+ Install llama.cpp through brew (works on Mac and Linux)
17
+
18
+ ```bash
19
+ brew install llama.cpp
20
+
21
+ ```
22
+ Invoke the llama.cpp server or the CLI.
23
+
24
+ ### CLI:
25
+ ```bash
26
+ llama-cli --hf-repo Pyroserenus/Orthrus-12b-v0.8-Q6_K-GGUF --hf-file orthrus-12b-v0.8-q6_k.gguf -p "The meaning to life and the universe is"
27
+ ```
28
+
29
+ ### Server:
30
+ ```bash
31
+ llama-server --hf-repo Pyroserenus/Orthrus-12b-v0.8-Q6_K-GGUF --hf-file orthrus-12b-v0.8-q6_k.gguf -c 2048
32
+ ```
33
+
34
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
35
+
36
+ Step 1: Clone llama.cpp from GitHub.
37
+ ```
38
+ git clone https://github.com/ggerganov/llama.cpp
39
+ ```
40
+
41
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
42
+ ```
43
+ cd llama.cpp && LLAMA_CURL=1 make
44
+ ```
45
+
46
+ Step 3: Run inference through the main binary.
47
+ ```
48
+ ./llama-cli --hf-repo Pyroserenus/Orthrus-12b-v0.8-Q6_K-GGUF --hf-file orthrus-12b-v0.8-q6_k.gguf -p "The meaning to life and the universe is"
49
+ ```
50
+ or
51
+ ```
52
+ ./llama-server --hf-repo Pyroserenus/Orthrus-12b-v0.8-Q6_K-GGUF --hf-file orthrus-12b-v0.8-q6_k.gguf -c 2048
53
+ ```
imatrix.dat ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40841826d7163513ccd52f4f4c980d8e3d36f785a1b2398f3209a982c8027fba
3
+ size 4536725
orthrus-12b-v0.8-q6_k.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:062b51ef1a712bb75686e1d08a911e6da553b3e94866a3521f99b0824d2a885f
3
+ size 10056209760