replit-code-v1_5-3b-GGUF

Runtime error

limcheekin commited on Nov 26, 2023

Commit

3cfa708

•

1 Parent(s): 2a40a34

feat: updated model download url and n_ctx param

Files changed (3) hide show

Dockerfile CHANGED Viewed

@@ -15,7 +15,7 @@ RUN pip install -U pip setuptools wheel && \
 # Download model
 RUN mkdir model && \
-    curl -L https://huggingface.co/lynnleelhl/replit-code-v1_5-3b-gguf/resolve/main/ggml-model-f16.gguf -o model/gguf-model.bin
 COPY ./start_server.sh ./
 COPY ./main.py ./

 # Download model
 RUN mkdir model && \
+    curl -L https://huggingface.co/abetlen/replit-code-v1_5-3b-GGUF/resolve/main/replit-code-v1_5-3b.f16.gguf -o model/gguf-model.bin
 COPY ./start_server.sh ./
 COPY ./main.py ./

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ colorTo: blue
 sdk: docker
 models:
   - replit/replit-code-v1_5-3b
-  - lynnleelhl/replit-code-v1_5-3b-gguf
 tags:
   - inference api
   - openai-api compatible

 sdk: docker
 models:
   - replit/replit-code-v1_5-3b
+  - abetlen/replit-code-v1_5-3b-GGUF
 tags:
   - inference api
   - openai-api compatible

main.py CHANGED Viewed

@@ -6,7 +6,8 @@ app = create_app(
     Settings(
         n_threads=2,  # set to number of cpu cores
         model="model/gguf-model.bin",
-        embedding=True
     )
 )

     Settings(
         n_threads=2,  # set to number of cpu cores
         model="model/gguf-model.bin",
+        embedding=True,
+        n_ctx=16192  # For GitHub Copilot
     )
 )