morriszms commited on
Commit
a00eb6c
1 Parent(s): 180567a

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ phi-2-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ phi-2-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ phi-2-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ phi-2-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ phi-2-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ phi-2-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ phi-2-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ phi-2-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ phi-2-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ phi-2-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ phi-2-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ phi-2-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,76 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ license_link: https://huggingface.co/microsoft/phi-2/resolve/main/LICENSE
4
+ language:
5
+ - en
6
+ pipeline_tag: text-generation
7
+ tags:
8
+ - nlp
9
+ - code
10
+ - TensorBlock
11
+ - GGUF
12
+ base_model: microsoft/phi-2
13
+ ---
14
+
15
+ <div style="width: auto; margin-left: auto; margin-right: auto">
16
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
17
+ </div>
18
+ <div style="display: flex; justify-content: space-between; width: 100%;">
19
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
20
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
21
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
22
+ </p>
23
+ </div>
24
+ </div>
25
+
26
+ ## microsoft/phi-2 - GGUF
27
+
28
+ This repo contains GGUF format model files for [microsoft/phi-2](https://huggingface.co/microsoft/phi-2).
29
+
30
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
31
+
32
+ ## Prompt template
33
+
34
+ ```
35
+
36
+ ```
37
+
38
+ ## Model file specification
39
+
40
+ | Filename | Quant type | File Size | Description |
41
+ | -------- | ---------- | --------- | ----------- |
42
+ | [phi-2-Q2_K.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q2_K.gguf) | Q2_K | 1.034 GB | smallest, significant quality loss - not recommended for most purposes |
43
+ | [phi-2-Q3_K_S.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q3_K_S.gguf) | Q3_K_S | 1.165 GB | very small, high quality loss |
44
+ | [phi-2-Q3_K_M.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q3_K_M.gguf) | Q3_K_M | 1.328 GB | very small, high quality loss |
45
+ | [phi-2-Q3_K_L.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q3_K_L.gguf) | Q3_K_L | 1.467 GB | small, substantial quality loss |
46
+ | [phi-2-Q4_0.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q4_0.gguf) | Q4_0 | 1.492 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
47
+ | [phi-2-Q4_K_S.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q4_K_S.gguf) | Q4_K_S | 1.508 GB | small, greater quality loss |
48
+ | [phi-2-Q4_K_M.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q4_K_M.gguf) | Q4_K_M | 1.618 GB | medium, balanced quality - recommended |
49
+ | [phi-2-Q5_0.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q5_0.gguf) | Q5_0 | 1.801 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
50
+ | [phi-2-Q5_K_S.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q5_K_S.gguf) | Q5_K_S | 1.801 GB | large, low quality loss - recommended |
51
+ | [phi-2-Q5_K_M.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q5_K_M.gguf) | Q5_K_M | 1.865 GB | large, very low quality loss - recommended |
52
+ | [phi-2-Q6_K.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q6_K.gguf) | Q6_K | 2.128 GB | very large, extremely low quality loss |
53
+ | [phi-2-Q8_0.gguf](https://huggingface.co/tensorblock/phi-2-GGUF/tree/main/phi-2-Q8_0.gguf) | Q8_0 | 2.755 GB | very large, extremely low quality loss - not recommended |
54
+
55
+
56
+ ## Downloading instruction
57
+
58
+ ### Command line
59
+
60
+ Firstly, install Huggingface Client
61
+
62
+ ```shell
63
+ pip install -U "huggingface_hub[cli]"
64
+ ```
65
+
66
+ Then, downoad the individual model file the a local directory
67
+
68
+ ```shell
69
+ huggingface-cli download tensorblock/phi-2-GGUF --include "phi-2-Q2_K.gguf" --local-dir MY_LOCAL_DIR
70
+ ```
71
+
72
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
73
+
74
+ ```shell
75
+ huggingface-cli download tensorblock/phi-2-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
76
+ ```
phi-2-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e08d782b8cba58be242d78cca8a6c69effc10a5da7b3d858dbc15e32b77c0b88
3
+ size 1109720128
phi-2-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c85410d6327c53ce696befe7804e2474cea01f57cf0a13a5263640d26de4508a
3
+ size 1575230528
phi-2-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e6d79af8b4d7eddbe2d55cdda3f60639aea5bc80cd59f5df114dd58a8285ffa
3
+ size 1426136128
phi-2-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:43e97b3b844bcc65eb49ebecf640317eb626f4ae905ac9b86d56d42f15863ab5
3
+ size 1250827328
phi-2-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7654105c20b53439b190d6fa1d511ceefb0594384cc25e0057e451e89ede9f07
3
+ size 1602468928
phi-2-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8c689e657c3f0351ff73da7bb053b9ab004e3100676d61d24de7e740c5078a7a
3
+ size 1737636928
phi-2-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:22497949c7e04bc3055d7d48d224424358d71d007a2860004e00e4a77049a582
3
+ size 1618852928
phi-2-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cafb945cfc51577be3d6d9d3c86e08a5ccaaa4b6e7b7b77abbef3e6c6a6396c6
3
+ size 1933425728
phi-2-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3959e50cbda13da92855c6f8f81b65f03754f2de24a19281231b03cd04cd55c8
3
+ size 2003057728
phi-2-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93397c5f1a14bd4b11a3afdccdc7539d6947d45ce997cc222ac132ee09cd7498
3
+ size 1933425728
phi-2-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1649b9bd7d5965c15e858039756b7398dffa3f52297a1ab4c8f4d997924fa166
3
+ size 2285067328
phi-2-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bc29d0e0c95c28a3baaeb25459fb845ae6ab265e3c2ea78329434aba4250d125
3
+ size 2958040128