Update README.md
Browse files
README.md
CHANGED
@@ -46,10 +46,16 @@ Many thanks to William Beauchamp from [Chai](https://chai-research.com/) for pro
|
|
46 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/upstage-llama-30b-instruct-2048-GGML)
|
47 |
* [Original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/upstage/llama-30b-instruct-2048)
|
48 |
|
49 |
-
## Prompt template:
|
50 |
|
51 |
```
|
|
|
|
|
|
|
|
|
52 |
{prompt}
|
|
|
|
|
53 |
```
|
54 |
|
55 |
<!-- compatibility_ggml start -->
|
@@ -106,7 +112,7 @@ Refer to the Provided Files table below to see what files use which methods, and
|
|
106 |
I use the following command line; adjust for your tastes and needs:
|
107 |
|
108 |
```
|
109 |
-
./main -t 10 -ngl 32 -m upstage-llama-30b-instruct-2048.ggmlv3.q4_0.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "###
|
110 |
```
|
111 |
Change `-t 10` to the number of physical CPU cores you have. For example if your system has 8 cores/16 threads, use `-t 8`.
|
112 |
|
|
|
46 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/upstage-llama-30b-instruct-2048-GGML)
|
47 |
* [Original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/upstage/llama-30b-instruct-2048)
|
48 |
|
49 |
+
## Prompt template: Orca-Hashes
|
50 |
|
51 |
```
|
52 |
+
### System:
|
53 |
+
{System}
|
54 |
+
|
55 |
+
### User:
|
56 |
{prompt}
|
57 |
+
|
58 |
+
### Assistant:
|
59 |
```
|
60 |
|
61 |
<!-- compatibility_ggml start -->
|
|
|
112 |
I use the following command line; adjust for your tastes and needs:
|
113 |
|
114 |
```
|
115 |
+
./main -t 10 -ngl 32 -m upstage-llama-30b-instruct-2048.ggmlv3.q4_0.bin --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "### System: You are a helpful assistant\n### User: write a story about llamas\n### Assistant:"
|
116 |
```
|
117 |
Change `-t 10` to the number of physical CPU cores you have. For example if your system has 8 cores/16 threads, use `-t 8`.
|
118 |
|