morriszms commited on
Commit
bb4d549
1 Parent(s): 66b59bd

Upload folder using huggingface_hub

Browse files
Phi-3-mini-4k-instruct-Q2_K.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4378f0432a9ab97caf8d17d5871f6978e13092187948b2e1f4c8bbe62c4ab127
3
- size 1416203712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9b82044047ddec6ab4137d06651361840a7f1008a0eae8eea597e27759fbadec
3
+ size 1446880320
Phi-3-mini-4k-instruct-Q3_K_L.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b4c2bc21a9e0410bb3d603bcc4590da8688f636f2fec3a6804796b06c02c8126
3
- size 2087596992
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:44808ba99c26ca5c89ee29d1ff1c294675d06c07b9cebda5d78841cd6830288c
3
+ size 2045135424
Phi-3-mini-4k-instruct-Q3_K_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4b64504667704b0b4de24ab606bfe0d7a3eddb8d6035da1083d6464a0ebaf81b
3
- size 1955476416
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:80f28d845dc4c6d0fef784362655f364c7a1ed196d9f858af06ca662e99065a4
3
+ size 1877625408
Phi-3-mini-4k-instruct-Q3_K_S.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8b4fe016b9413a211747f05a481d5059e0970f323951f2a889dbc1f4fd6daef6
3
- size 1681798080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:32dbfe5c6000c4c6bb4e3bc2f679f37329da4ccdb893948252ff225c28bff9cb
3
+ size 1681803840
Phi-3-mini-4k-instruct-Q4_0.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ff17d85e49bd0d295d2592e095e147539d9a95a7f1a43e1f7ab85e8fe83336b1
3
- size 2176176576
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2cd87ccae8eb2b0836ffd7a7a3bc122ca6a62d0f5cd93dc983c0859f6e1e7b9
3
+ size 2176182336
Phi-3-mini-4k-instruct-Q4_K_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4fb2acdf5fd6705a0ed184c72bfa613ca4581c60b01ee5a739aee25bd395937f
3
- size 2393231808
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f83f14c7bbfd894a9a7502cfbd9a6759ce8286aa9799924624f529c647a8efe5
3
+ size 2318919744
Phi-3-mini-4k-instruct-Q4_K_S.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:17e0666f480c3a4888737a3e44df43abae4261f7210c3dac1b7d154e8ef05fc4
3
- size 2188759488
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e6ac67b3ed7929d3b63c1e00220340c19295c1d278ab11b7289c88fa7b187ec
3
+ size 2193483840
Phi-3-mini-4k-instruct-Q5_0.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:85d74e540c5a280fdbd71be20b34184f93f4bbabb72a5e983abeacb25ff251c4
3
- size 2641473984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce547afba1d0927c083583b851ff27aaaaf9dbab2064ef92485fc6b9fd70fd35
3
+ size 2641479744
Phi-3-mini-4k-instruct-Q5_K_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:60bbad0c32710c1388ea1a0fd0ba191d4784715cc7c4310bff5d07c3352e89c7
3
- size 2815275456
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea219f963f6eee55169060fc1a54185dd308a7cac14061a4653d7ed9d06a3412
3
+ size 2715011136
Phi-3-mini-4k-instruct-Q5_K_S.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d2115c30de0416b017271b79cf53aaf7e4134d683c792f836e27b32735ae76d2
3
- size 2641473984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a692c07003686dfa5bd7a39826217e45ca9db89021762c6ca4d0cdc769115b8d
3
+ size 2641479744
Phi-3-mini-4k-instruct-Q6_K.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3fe20b7b66b5bfde619f4bf0b4d63863eade43a67c50fcb7e62f92ae652e38fb
3
- size 3135852480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c7e9e8bad768b2e4badcef1ec0d809fa6f81fb84a9353c22e31bfa0d5d4d1ab
3
+ size 3135858240
Phi-3-mini-4k-instruct-Q8_0.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:34e3863005661d5673808022e543c342cff35274036c063651385e1df13fc60a
3
- size 4061221824
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:863a8f851c2108ab1fed787f26deba4eeff4c5fa3e59ff7413363124e9493f35
3
+ size 4061227584
README.md CHANGED
@@ -1,23 +1,16 @@
1
  ---
2
- license: mit
3
- license_link: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct/resolve/main/LICENSE
4
  language:
5
  - en
6
- - fr
7
- pipeline_tag: text-generation
8
  tags:
9
- - nlp
10
- - code
 
 
11
  - TensorBlock
12
  - GGUF
13
- inference:
14
- parameters:
15
- temperature: 0
16
- widget:
17
- - messages:
18
- - role: user
19
- content: Can you provide ways to eat combinations of bananas and dragonfruits?
20
- base_model: microsoft/Phi-3-mini-4k-instruct
21
  ---
22
 
23
  <div style="width: auto; margin-left: auto; margin-right: auto">
@@ -31,9 +24,9 @@ base_model: microsoft/Phi-3-mini-4k-instruct
31
  </div>
32
  </div>
33
 
34
- ## microsoft/Phi-3-mini-4k-instruct - GGUF
35
 
36
- This repo contains GGUF format model files for [microsoft/Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct).
37
 
38
  The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
39
 
@@ -51,16 +44,16 @@ The files were quantized using machines provided by [TensorBlock](https://tensor
51
 
52
  | Filename | Quant type | File Size | Description |
53
  | -------- | ---------- | --------- | ----------- |
54
- | [Phi-3-mini-4k-instruct-Q2_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q2_K.gguf) | Q2_K | 1.319 GB | smallest, significant quality loss - not recommended for most purposes |
55
  | [Phi-3-mini-4k-instruct-Q3_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_S.gguf) | Q3_K_S | 1.566 GB | very small, high quality loss |
56
- | [Phi-3-mini-4k-instruct-Q3_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_M.gguf) | Q3_K_M | 1.821 GB | very small, high quality loss |
57
- | [Phi-3-mini-4k-instruct-Q3_K_L.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_L.gguf) | Q3_K_L | 1.944 GB | small, substantial quality loss |
58
  | [Phi-3-mini-4k-instruct-Q4_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_0.gguf) | Q4_0 | 2.027 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
59
- | [Phi-3-mini-4k-instruct-Q4_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_S.gguf) | Q4_K_S | 2.038 GB | small, greater quality loss |
60
- | [Phi-3-mini-4k-instruct-Q4_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_M.gguf) | Q4_K_M | 2.229 GB | medium, balanced quality - recommended |
61
  | [Phi-3-mini-4k-instruct-Q5_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_0.gguf) | Q5_0 | 2.460 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
62
  | [Phi-3-mini-4k-instruct-Q5_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_S.gguf) | Q5_K_S | 2.460 GB | large, low quality loss - recommended |
63
- | [Phi-3-mini-4k-instruct-Q5_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_M.gguf) | Q5_K_M | 2.622 GB | large, very low quality loss - recommended |
64
  | [Phi-3-mini-4k-instruct-Q6_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q6_K.gguf) | Q6_K | 2.920 GB | very large, extremely low quality loss |
65
  | [Phi-3-mini-4k-instruct-Q8_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q8_0.gguf) | Q8_0 | 3.782 GB | very large, extremely low quality loss - not recommended |
66
 
 
1
  ---
 
 
2
  language:
3
  - en
4
+ library_name: transformers
5
+ license: mit
6
  tags:
7
+ - unsloth
8
+ - transformers
9
+ - phi3
10
+ - phi
11
  - TensorBlock
12
  - GGUF
13
+ base_model: unsloth/Phi-3-mini-4k-instruct
 
 
 
 
 
 
 
14
  ---
15
 
16
  <div style="width: auto; margin-left: auto; margin-right: auto">
 
24
  </div>
25
  </div>
26
 
27
+ ## unsloth/Phi-3-mini-4k-instruct - GGUF
28
 
29
+ This repo contains GGUF format model files for [unsloth/Phi-3-mini-4k-instruct](https://huggingface.co/unsloth/Phi-3-mini-4k-instruct).
30
 
31
  The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
32
 
 
44
 
45
  | Filename | Quant type | File Size | Description |
46
  | -------- | ---------- | --------- | ----------- |
47
+ | [Phi-3-mini-4k-instruct-Q2_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q2_K.gguf) | Q2_K | 1.348 GB | smallest, significant quality loss - not recommended for most purposes |
48
  | [Phi-3-mini-4k-instruct-Q3_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_S.gguf) | Q3_K_S | 1.566 GB | very small, high quality loss |
49
+ | [Phi-3-mini-4k-instruct-Q3_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_M.gguf) | Q3_K_M | 1.749 GB | very small, high quality loss |
50
+ | [Phi-3-mini-4k-instruct-Q3_K_L.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_L.gguf) | Q3_K_L | 1.905 GB | small, substantial quality loss |
51
  | [Phi-3-mini-4k-instruct-Q4_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_0.gguf) | Q4_0 | 2.027 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
52
+ | [Phi-3-mini-4k-instruct-Q4_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_S.gguf) | Q4_K_S | 2.043 GB | small, greater quality loss |
53
+ | [Phi-3-mini-4k-instruct-Q4_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_M.gguf) | Q4_K_M | 2.160 GB | medium, balanced quality - recommended |
54
  | [Phi-3-mini-4k-instruct-Q5_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_0.gguf) | Q5_0 | 2.460 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
55
  | [Phi-3-mini-4k-instruct-Q5_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_S.gguf) | Q5_K_S | 2.460 GB | large, low quality loss - recommended |
56
+ | [Phi-3-mini-4k-instruct-Q5_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_M.gguf) | Q5_K_M | 2.529 GB | large, very low quality loss - recommended |
57
  | [Phi-3-mini-4k-instruct-Q6_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q6_K.gguf) | Q6_K | 2.920 GB | very large, extremely low quality loss |
58
  | [Phi-3-mini-4k-instruct-Q8_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q8_0.gguf) | Q8_0 | 3.782 GB | very large, extremely low quality loss - not recommended |
59