jpacifico
/

Chocolatine-3B-Instruct-DPO-Revised

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on Sep 12

Commit

88f25e5

•

1 Parent(s): ca46218

Update README.md

Files changed (1) hide show

README.md +28 -23

README.md CHANGED Viewed

@@ -101,6 +101,34 @@ French-Alpaca-7B-Instruct_beta                 5.587866
 vigogne-2-7b-chat                              4.218750
 ```
 ### Usage
 You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_3B_inference_test_colab.ipynb)
@@ -138,29 +166,6 @@ sequences = pipeline(
 print(sequences[0]['generated_text'])
 ```
-* **4-bit quantized version** is available here : [jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q4_K_M-GGUF](https://huggingface.co/jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q4_K_M-GGUF)
-* **Ollama**: [jpacifico/chocolatine-3b](https://ollama.com/jpacifico/chocolatine-3b)
-```bash
-ollama run jpacifico/chocolatine-3b
-```
-Ollama *Modelfile* example :
-```bash
-FROM ./chocolatine-3b-instruct-dpo-revised-q4_k_m.gguf
-TEMPLATE """{{ if .System }}<|system|>
-{{ .System }}<|end|>
-{{ end }}{{ if .Prompt }}<|user|>
-{{ .Prompt }}<|end|>
-{{ end }}<|assistant|>
-{{ .Response }}<|end|>
-"""
-PARAMETER stop """{"stop": ["<|end|>","<|user|>","<|assistant|>"]}"""
-SYSTEM """You are a friendly assistant called Chocolatine."""
-```
 ### Limitations
 The Chocolatine model is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.

 vigogne-2-7b-chat                              4.218750
 ```
+### Quantized versions
+* **4-bit quantized version** is available here : [jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q4_K_M-GGUF](https://huggingface.co/jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q4_K_M-GGUF)
+* **8-bit quantized version** also available here : [jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q8_0-GGUF](https://huggingface.co/jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q8_0-GGUF)
+* **Ollama**: [jpacifico/chocolatine-3b](https://ollama.com/jpacifico/chocolatine-3b)
+```bash
+ollama run jpacifico/chocolatine-3b
+```
+Ollama *Modelfile* example :
+```bash
+FROM ./chocolatine-3b-instruct-dpo-revised-q4_k_m.gguf
+TEMPLATE """{{ if .System }}<|system|>
+{{ .System }}<|end|>
+{{ end }}{{ if .Prompt }}<|user|>
+{{ .Prompt }}<|end|>
+{{ end }}<|assistant|>
+{{ .Response }}<|end|>
+"""
+PARAMETER stop """{"stop": ["<|end|>","<|user|>","<|assistant|>"]}"""
+SYSTEM """You are a friendly assistant called Chocolatine."""
+```
 ### Usage
 You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_3B_inference_test_colab.ipynb)
 print(sequences[0]['generated_text'])
 ```
 ### Limitations
 The Chocolatine model is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.