bartowski commited on
Commit
6659e13
1 Parent(s): 7634f1d

Llamacpp quants

Browse files
README.md CHANGED
@@ -13,7 +13,7 @@ quantized_by: bartowski
13
 
14
  ## Llamacpp imatrix Quantizations of gemma-2-27b-it
15
 
16
- Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b3266">b3266</a> for quantization.
17
 
18
  Original model: https://huggingface.co/google/gemma-2-27b-it
19
 
@@ -22,9 +22,11 @@ All quants made using imatrix option with dataset from [here](https://gist.githu
22
  ## Prompt format
23
 
24
  ```
25
- <start_of_turn>user
26
  {prompt}<end_of_turn>
27
  <start_of_turn>model
 
 
28
 
29
  ```
30
 
 
13
 
14
  ## Llamacpp imatrix Quantizations of gemma-2-27b-it
15
 
16
+ Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b3277">b3277</a> for quantization.
17
 
18
  Original model: https://huggingface.co/google/gemma-2-27b-it
19
 
 
22
  ## Prompt format
23
 
24
  ```
25
+ <bos><start_of_turn>user
26
  {prompt}<end_of_turn>
27
  <start_of_turn>model
28
+ <end_of_turn>
29
+ <start_of_turn>model
30
 
31
  ```
32
 
gemma-2-27b-it-IQ2_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b57a62bef93780a1f4443284c2560626e8cce03e8dc1093dc79e9a47ea8f0039
3
- size 9398878656
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e52e33e837223b71ca4bdb6cb0524f057fa7a750b429235a02796eca5e6e346f
3
+ size 9398878688
gemma-2-27b-it-IQ2_S.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:eca060b0336b11a1d68416fc1c16e81af89a35e3fd88e3461cced97983cc1620
3
- size 8652161472
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:700979bdac9262611d9b4f28a7beb94f54239532c4f326acfbc0565d470ebd64
3
+ size 8652161504
gemma-2-27b-it-IQ2_XS.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:80bc7e22d276aa2ecf418e7d99d0410e276b79dc9f75f73e08374c6d14cde817
3
- size 8399716800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3354c9e39ba771af353c1406738317a8e6f38efda9eaf808a6cc6ab578b19e2
3
+ size 8399716832
gemma-2-27b-it-IQ3_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:62569c50d8b7b08c72f14e9f0ff6f0c688ec6f021a531a770d7e49f95ae6df0f
3
- size 12454830528
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ed89ef6c931e93fd8881198dd7037a00611fcca54232f20a3c21bf53396bb317
3
+ size 12454830560
gemma-2-27b-it-IQ3_XS.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d33750cc8acae03b6849aff9478c703f62e4f140ba4c000b43fc04994ad62a93
3
- size 11550630336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37a3c5ada8c0b12bb9f503c02e3b550999e03c806c514e5c52192aa1545e39a9
3
+ size 11550630368
gemma-2-27b-it-IQ3_XXS.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:382d5cd0ec92b8c0e10b27b3f63f761c7df894ef29e47272ada87a0bcf09d28d
3
- size 10750755264
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58e86eb782847d588e53cd17ef1ae38993e2efd7bfce898316ef1dc445dc0639
3
+ size 10750755296
gemma-2-27b-it-IQ4_XS.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ed2f0348cd1b2d445aab7b09e4d1d0810cde53b0ea9c323014c554c2f2ecf376
3
- size 14814421440
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:49a2905ca3ae2df5e89f99849250ba24bd85a28924e53f197ff87b99de825c72
3
+ size 14814421472
gemma-2-27b-it-Q2_K.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1196081ee282bed6de7af9c399bf976d263032f1ee871032f6ea41e737d88eb8
3
- size 10449576384
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:505284e7c5c6773907c79be25fee4d3ee63ff8e2c7e9d2117dae35980bd5de15
3
+ size 10449576416
gemma-2-27b-it-Q2_K_L.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ca3c39bd3ffb31a177bfe4d3a5e8eb7ef00d5b53c140a77e9f540105e9d4ab0a
3
- size 11841192384
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dfcd6f5351bf5f792dc36734ad71621240641a5c717181197c81f68ca4b0631a
3
+ size 11841192416
gemma-2-27b-it-Q3_K_L.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:668d77919a7b1ffe1dea3cf581a99e1a4946766e634faf854095ebcdd7f1f023
3
- size 14519361984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:693b4326dab57dc3bcbbfb0dfa9b9c8acfc8bb3d48f10fcbe62b89cf187ab124
3
+ size 14519362016
gemma-2-27b-it-Q3_K_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d84e63e1431f8bf0005d58e19dbf9931d89a8ca2250f12cd369f20485e80e2d2
3
- size 13424648640
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3bf06416abbbd7c32ff5ce96b06ed299ae0d120089ef41e1ae1576df2076dc2d
3
+ size 13424648672
gemma-2-27b-it-Q3_K_S.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:59b493539449b01fc2bdd6624267e4c119a02f1ff95e576ffdc28b24c019f96e
3
- size 12169060800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ebba0b12f4e364c0b0293afa8b4e35ffc6684773ff39773aee8ee7bdfba42ece
3
+ size 12169060832
gemma-2-27b-it-Q3_K_XL.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c0bc9462a24c7fb5dbec57dd4d30d70083da04667ef287303b48d22bb62b579a
3
- size 15910977984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cf09bbc9d486430f747e0e616f659d80da019feea724c9840fe971a71a101f0e
3
+ size 15910978016
gemma-2-27b-it-Q4_K_L.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6d3ffb19246c88d567e34a58d8e38ad828dac8d81faebc990f01c97d0ad0a5d0
3
- size 18036998592
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb06a4e5fe15b11c386339463389a8757ac2f88686c5fb2bb6cbb7c3cf6a8ce5
3
+ size 18036998624
gemma-2-27b-it-Q4_K_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ca86fbdb791842cf2e5eb276a6916e326b3b5d58d9ab60ee3e18b1c6f01fc181
3
- size 16645382592
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69a0cdba2bc2e56d8298a9330f2a050ebecd657ed315beb3e51ae427b224dbc7
3
+ size 16645382624
gemma-2-27b-it-Q4_K_S.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0fd751dda128342f4296cb8388484f72e47de990cb8716c5f4b65589647a5f86
3
- size 15739265472
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2874871cb446f28254a044da59808d5edc8de45324dd9e85a361dce9e3b48590
3
+ size 15739265504
gemma-2-27b-it-Q5_K_L.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:caa11566a6f2e91988308de687ab960991b76ba155e08a007e9051b134577913
3
- size 20799734208
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5d6ada920573b4fbcb1010ca091012d6324ccb9b449762a8c574bb7a5694686c
3
+ size 20799734240
gemma-2-27b-it-Q5_K_M.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9048d3a9c292287d193af3f98a2898fc75b9168dea5680ba838fc048e3903015
3
- size 19408118208
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:830e5a25cd40b960e9bf5eaa2c9199f636647400b9760b65c6fd1305b11c3b9f
3
+ size 19408118240
gemma-2-27b-it-Q5_K_S.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6a04746410e8492f06ea2d1ab6ec0f30d5cbaf56027a1252191c2b9c2278e41d
3
- size 18884207040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0915ac3b3523694968e5a40d55d4cd78b994750fa352153278065c6b36ce8a99
3
+ size 18884207072
gemma-2-27b-it-Q6_K.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:33bdd099ee057c8097fa9a8a0e2d23807afea622b220376dc7b2aab8ea287917
3
- size 22343524800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8516647ad9cbec615ea044a29107cb973136fb09d0dca6fd347c398949751005
3
+ size 22343524832
gemma-2-27b-it-Q6_K_L.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ab32da8eed77f2368f76887b19d7fc87ae77247005d9ce02702979db8270da54
3
- size 23735140800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:325e5a64dfc7abc0c3f0787c5bbb6dc653a69f108911624777943d96c915d808
3
+ size 23735140832
gemma-2-27b-it-Q8_0.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:76f73556fa30bdfa510d3dacc5245d9488ca83e264c354512167a884a66746cd
3
- size 28937388480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b0f19c0528d5e1d9bb6a38a25b0f49131409d7c30aba0d79c5b5aeed1ea6cf77
3
+ size 28937388512
gemma-2-27b-it-Q8_0_L.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d27e8b1954459960dcd4770644486dcfe815edc6f4b853427d3be7446023bf58
3
- size 30043308480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b952b72f433fa83b48e391a1a2a52a8fdda917359751a04482f234c40326490
3
+ size 30043308512
gemma-2-27b-it-f32.gguf/gemma-2-27b-it-f32-00001-of-00003.gguf CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3de30298e3307ec26163cb5bf63c25c42e5924fd7167bbecbb7102408e5cd14c
3
- size 39605588480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:330698e639be75dfd5e3f4ad08652afc560cd899bffa2ec163c1655259a1222e
3
+ size 39605588544
gemma-2-27b-it.imatrix CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:de3960ddb965f3fbce726bac948f6f0bcf1e37a320b341fc19e383c6712bcf8b
3
  size 11786697
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7ba2252f6db47e345471154af6022b614d280f1a3498ed90bd637ef1b28d3f3f
3
  size 11786697