shaowenchen
commited on
Commit
•
219f5fd
1
Parent(s):
95454ca
add images to readme
Browse files
README.md
CHANGED
@@ -40,3 +40,18 @@ tags:
|
|
40 |
| llama-2-13b-langchain-chat.Q6_K.gguf | Q6_K | 9.9 GB |
|
41 |
| llama-2-13b-langchain-chat.Q8_0.gguf | Q8_0 | 13 GB |
|
42 |
| llama-2-13b-langchain-chat.gguf | ful | 24 GB |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
| llama-2-13b-langchain-chat.Q6_K.gguf | Q6_K | 9.9 GB |
|
41 |
| llama-2-13b-langchain-chat.Q8_0.gguf | Q8_0 | 13 GB |
|
42 |
| llama-2-13b-langchain-chat.gguf | ful | 24 GB |
|
43 |
+
|
44 |
+
## Provided images
|
45 |
+
|
46 |
+
| Name | Quant method | Size |
|
47 |
+
| -------------------------------------------------- | ------------ | ------- |
|
48 |
+
| `shaowenchen/llama-2-13b-langchain-chat-gguf:Q4_K` | Q4_K | 16.7 GB |
|
49 |
+
| `shaowenchen/llama-2-13b-langchain-chat-gguf:Q5_K` | Q5_K | 19.5 GB |
|
50 |
+
|
51 |
+
Usage:
|
52 |
+
|
53 |
+
```
|
54 |
+
docker run --rm -p 8000:8000 shaowenchen/llama-2-13b-langchain-chat-gguf:Q4_K
|
55 |
+
```
|
56 |
+
|
57 |
+
and you can view http://localhost:8000/docs to see the swagger UI.
|