phate334 commited on
Commit
080da2e
1 Parent(s): dce4512

add Docker example

Browse files
Files changed (1) hide show
  1. README.md +29 -1
README.md CHANGED
@@ -10,4 +10,32 @@ tags:
10
  ---
11
 
12
  # phate334/multilingual-e5-large-gguf
13
- This model was converted to GGUF format from [`intfloat/multilingual-e5-large`](https://huggingface.co/intfloat/multilingual-e5-large) using llama.cpp.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  # phate334/multilingual-e5-large-gguf
13
+ This model was converted to GGUF format from [`intfloat/multilingual-e5-large`](https://huggingface.co/intfloat/multilingual-e5-large) using llama.cpp.
14
+
15
+ ## Run it
16
+
17
+ - Deploy using Docker
18
+
19
+ ```bash
20
+ $ docker run -p 8080:8080 -v ./multilingual-e5-large-q4_k_m.gguf:/multilingual-e5-large-q4_k_m.gguf ghcr.io/ggerganov/llama.cpp:server--b1-4b9afbb --host 0.0.0.0 --embedding -m /multilingual-e5-large-q4_k_m.gguf
21
+ ```
22
+
23
+ or Docker Compose
24
+
25
+ ```yaml
26
+ services:
27
+ e5-f16:
28
+ image: ghcr.io/ggerganov/llama.cpp:server--b1-4b9afbb
29
+ ports:
30
+ - 8080:8080
31
+ volumes:
32
+ - ./multilingual-e5-large-f16.gguf:/multilingual-e5-large-f16.gguf
33
+ command: --host 0.0.0.0 --embedding -m /multilingual-e5-large-f16.gguf
34
+ e5-q4:
35
+ image: ghcr.io/ggerganov/llama.cpp:server--b1-4b9afbb
36
+ ports:
37
+ - 8081:8080
38
+ volumes:
39
+ - ./multilingual-e5-large-q4_k_m.gguf:/multilingual-e5-large-q4_k_m.gguf
40
+ command: --host 0.0.0.0 --embedding -m /multilingual-e5-large-q4_k_m.gguf
41
+ ```