emnakamura
commited on
Commit
•
74f5a89
1
Parent(s):
4462537
Upload folder using huggingface_hub
Browse files
README.md
ADDED
@@ -0,0 +1,48 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
library_name: transformers
|
3 |
+
license: apache-2.0
|
4 |
+
base_model:
|
5 |
+
- Mahou-1.3-M1-mistral-7B
|
6 |
+
datasets:
|
7 |
+
- flammenai/MahouMix-v1
|
8 |
+
---
|
9 |
+
![image/png](https://huggingface.co/flammenai/Mahou-1.0-mistral-7B/resolve/main/mahou1.png)
|
10 |
+
|
11 |
+
# Mahou-1.3a-mistral-7B
|
12 |
+
|
13 |
+
Mahou is designed to provide short messages in a conversational context. It is capable of casual conversation and character roleplay.
|
14 |
+
|
15 |
+
### Chat Format
|
16 |
+
|
17 |
+
This model has been trained to use ChatML format. Note the additional tokens in [tokenizer_config.json](tokenizer_config.json).
|
18 |
+
|
19 |
+
```
|
20 |
+
<|im_start|>system
|
21 |
+
{{system}}<|im_end|>
|
22 |
+
<|im_start|>{{char}}
|
23 |
+
{{message}}<|im_end|>
|
24 |
+
<|im_start|>{{user}}
|
25 |
+
{{message}}<|im_end|>
|
26 |
+
```
|
27 |
+
|
28 |
+
### Roleplay Format
|
29 |
+
|
30 |
+
- Speech without quotes.
|
31 |
+
- Actions in `*asterisks*`
|
32 |
+
|
33 |
+
```
|
34 |
+
*leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
|
35 |
+
```
|
36 |
+
|
37 |
+
### ST Settings
|
38 |
+
|
39 |
+
1. Use ChatML for the Context Template.
|
40 |
+
2. Enable Instruct Mode.
|
41 |
+
3. Use the [Mahou preset](https://huggingface.co/datasets/flammenai/Mahou-ST-ChatML-Instruct/raw/main/Mahou.json).
|
42 |
+
4. Recommended: Add newline as a stopping string: `["\n"]`
|
43 |
+
|
44 |
+
### Method
|
45 |
+
|
46 |
+
DPO finetuned for 6 epochs using an A100 on Google Colab.
|
47 |
+
|
48 |
+
[Fine-tune a Mistral-7b model with Direct Preference Optimization](https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac) - [Maxime Labonne](https://huggingface.co/mlabonne)
|