Add Q5_K_M model

Browse files

Signed-off-by: Xin Liu <[email protected]>

Files changed (3) hide show

.gitattributes +1 -0
Meta-Llama-3-8B-Instruct-Q5_K_M.gguf +3 -0
README.md +67 -1

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.gguf filter=lfs diff=lfs merge=lfs -text

Meta-Llama-3-8B-Instruct-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2a50fd6d990f523bcec82460b7f78cc4ac1cb927c004c04e7f843bcf9f21a260
+size 5732987072

README.md CHANGED Viewed

@@ -1,3 +1,69 @@
 ---
-license: apache-2.0
 ---

 ---
+language:
+- en
+license: other
+license_name: llama3
+model_name: Llama3 8B Instruct
+arxiv: 2307.09288
+base_model: meta-llama/Meta-Llama-3-8B-Instruct
+inference: false
+model_creator: Meta Llama3
+model_type: llama
+pipeline_tag: text-generation
+quantized_by: Second State Inc.
 ---
+<!-- header start -->
+<!-- 200823 -->
+<div style="width: auto; margin-left: auto; margin-right: auto">
+<img src="https://github.com/LlamaEdge/LlamaEdge/raw/dev/assets/logo.svg" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<hr style="margin-top: 1.0em; margin-bottom: 1.0em;">
+<!-- header end -->
+# Llama-3-8B-Instruct-GGUF
+## Original Model
+[meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
+## Run with LlamaEdge
+- LlamaEdge version: coming soon
+- Prompt template
+  - Prompt type: `llama-3-chat`
+  - Prompt string
+    ```text
+    <|begin_of_text|><|start_header_id|>system<|end_header_id|>
+    {{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
+    {{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
+    {{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
+    {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
+    ```
+- Context size: `4096`
+- Run as LlamaEdge service
+  ```bash
+  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Meta-Llama-3-8B-Instruct.Q5_K_M.gguf \
+    llama-api-server.wasm \
+    --prompt-template llama-3-chat \
+    --context-size 4096 \
+    --model-name Llama-3-8b
+  ```
+<!--
+- Run as LlamaEdge command app
+  ```bash
+  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama-2-7b-chat-hf-Q5_K_M.gguf llama-chat.wasm -p llama-2-chat
+  ``` -->