bingal
/

Qwen1.5-14B-Chat-llamafile

Model card Files Files and versions Community

bingal commited on Feb 6

Commit

3066703

•

1 Parent(s): f974e68

Update README.md

Files changed (1) hide show

README.md +13 -8

README.md CHANGED Viewed

@@ -11,16 +11,21 @@ license: mit
 ## Useage
 1. Download model: [qwen1.5-14b-chat-q5_k_m.llamafile](https://huggingface.co/bingal/Qwen1.5-14B-Chat-llamafile/resolve/main/qwen1.5-14b-chat-q5_k_m.llamafile)
 2. Run the model
-    * Windows
-        1. Rename the file to `qwen1.5-14b-chat-q5_k_m.exe`
-        2. Open terminal window, and run: `\qwen1.5-14b-chat-q5_k_m.exe`
-        3. Open browser to http://127.0.0.1:8080 to start chatting
-    * Linux / macOS
-        1. Add execution permissions: `chmod +x ./qwen1.5-14b-chat-q5_k_m.llamafile`
-        2. Run in terminal: `./qwen1.5-14b-chat-q5_k_m.llamafile`
-        3. Open browser to http://127.0.0.1:8080 to start chatting
 3. Openai api usage
    * api url: `http://127.0.0.1:8080/v1`
    * Python code:

 ## Useage
+**The Windows system has a limitation where it does not support a single exe file over 4GB, so it is necessary to download the llamafile and gguf models separately and run them individually.**
+### Windows
+  * Download [llamafile](https://github.com/Mozilla-Ocho/llamafile/releases/download/0.6.2/llamafile-0.6.2).
+  * Rename the file to `llamafile-0.6.2.exe`
+  * Download GGUF model [qwen1_5-14b-chat-q5_k_m.gguf](https://huggingface.co/Qwen/Qwen1.5-14B-Chat-GGUF/resolve/main/qwen1_5-14b-chat-q5_k_m.gguf)
+  * Open terminal window, and run: `\llamafile-0.6.2.exe .\qwen1_5-14b-chat-q5_k_m.gguf -ngl 9999 --host 0.0.0.0 --port 8080`
+  * Open browser to http://127.0.0.1:8080 to start chatting
+### Linux / macOS
 1. Download model: [qwen1.5-14b-chat-q5_k_m.llamafile](https://huggingface.co/bingal/Qwen1.5-14B-Chat-llamafile/resolve/main/qwen1.5-14b-chat-q5_k_m.llamafile)
 2. Run the model
+  * Add execution permissions: `chmod +x ./qwen1.5-14b-chat-q5_k_m.llamafile`
+  * Run in terminal: `./qwen1.5-14b-chat-q5_k_m.llamafile`
+  * Open browser to http://127.0.0.1:8080 to start chatting
 3. Openai api usage
    * api url: `http://127.0.0.1:8080/v1`
    * Python code: