Semantically-AI
/

zephyr-7b-beta-pruned50-GGUF

Model card Files Files and versions Community

Alex Ji commited on Jan 10

Commit

5ee92cb

•

1 Parent(s): b50a2ce

add to readme

Files changed (1) hide show

README.md +10 -5

README.md CHANGED Viewed

@@ -58,13 +58,18 @@ The GGUF model is pruned to 50% using sparseGPT method [sparseGPT](https://githu
 </details>
 <!-- explaination end -->
 from llama_cpp import Llama
 llm = Llama(model_path="zephyr-7b-beta-pruned50-Q8_0.gguf")
 output = llm("""
-            <|system|>
-            You are a friendly chatbot who always responds in the style of a pirate.</s>
-            <|user|>
-            How many helicopters can a human eat in one sitting?</s>
-            <|assistant|>""")
 print(output)
 #
 <!-- README_GGUF.md-how-to-run start -->

 </details>
 <!-- explaination end -->
 from llama_cpp import Llama
 llm = Llama(model_path="zephyr-7b-beta-pruned50-Q8_0.gguf")
 output = llm("""
+<|system|>
+You are a friendly chatbot who always responds in the style of a pirate.</s>
+<|user|>
+How many helicopters can a human eat in one sitting?</s>
+<|assistant|>""")
 print(output)
 #
 <!-- README_GGUF.md-how-to-run start -->