chore: prompt format
Browse files
README.md
CHANGED
@@ -28,6 +28,17 @@ StripedHyena is a hybrid architecture composed of multi-head, grouped-query atte
|
|
28 |
- Improvement to training and inference-optimal scaling laws, compared to optimized Transformer architectures such as Llama-2.
|
29 |
- Trained on sequences of up to 32k, allowing it to process longer prompts.
|
30 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
31 |
### Disclaimer
|
32 |
|
33 |
To use StripedHyena outside of the playground, you will need to install custom kernels. Please follow the instructions from the [standalone repository](https://github.com/togethercomputer/stripedhyena).
|
|
|
28 |
- Improvement to training and inference-optimal scaling laws, compared to optimized Transformer architectures such as Llama-2.
|
29 |
- Trained on sequences of up to 32k, allowing it to process longer prompts.
|
30 |
|
31 |
+
### Prompt Format
|
32 |
+
|
33 |
+
StripedHyena-Nous 7B uses ChatML as the prompt format:
|
34 |
+
|
35 |
+
```
|
36 |
+
<|im_start|>user
|
37 |
+
Hello!<|im_end|>
|
38 |
+
<|im_start|>assistant
|
39 |
+
Hi there!<|im_end|>
|
40 |
+
```
|
41 |
+
|
42 |
### Disclaimer
|
43 |
|
44 |
To use StripedHyena outside of the playground, you will need to install custom kernels. Please follow the instructions from the [standalone repository](https://github.com/togethercomputer/stripedhyena).
|