Transformers
GGUF
Inference Endpoints
conversational
emnakamura commited on
Commit
74f5a89
1 Parent(s): 4462537

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ base_model:
5
+ - Mahou-1.3-M1-mistral-7B
6
+ datasets:
7
+ - flammenai/MahouMix-v1
8
+ ---
9
+ ![image/png](https://huggingface.co/flammenai/Mahou-1.0-mistral-7B/resolve/main/mahou1.png)
10
+
11
+ # Mahou-1.3a-mistral-7B
12
+
13
+ Mahou is designed to provide short messages in a conversational context. It is capable of casual conversation and character roleplay.
14
+
15
+ ### Chat Format
16
+
17
+ This model has been trained to use ChatML format. Note the additional tokens in [tokenizer_config.json](tokenizer_config.json).
18
+
19
+ ```
20
+ <|im_start|>system
21
+ {{system}}<|im_end|>
22
+ <|im_start|>{{char}}
23
+ {{message}}<|im_end|>
24
+ <|im_start|>{{user}}
25
+ {{message}}<|im_end|>
26
+ ```
27
+
28
+ ### Roleplay Format
29
+
30
+ - Speech without quotes.
31
+ - Actions in `*asterisks*`
32
+
33
+ ```
34
+ *leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
35
+ ```
36
+
37
+ ### ST Settings
38
+
39
+ 1. Use ChatML for the Context Template.
40
+ 2. Enable Instruct Mode.
41
+ 3. Use the [Mahou preset](https://huggingface.co/datasets/flammenai/Mahou-ST-ChatML-Instruct/raw/main/Mahou.json).
42
+ 4. Recommended: Add newline as a stopping string: `["\n"]`
43
+
44
+ ### Method
45
+
46
+ DPO finetuned for 6 epochs using an A100 on Google Colab.
47
+
48
+ [Fine-tune a Mistral-7b model with Direct Preference Optimization](https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac) - [Maxime Labonne](https://huggingface.co/mlabonne)