flammenai
/

Mahou-1.3a-mistral-7B-GGUF

emnakamura commited on Jun 2

Commit

74f5a89

•

1 Parent(s): 4462537

Upload folder using huggingface_hub

Files changed (1) hide show

README.md ADDED Viewed

+---
+library_name: transformers
+license: apache-2.0
+base_model:
+- Mahou-1.3-M1-mistral-7B
+datasets:
+- flammenai/MahouMix-v1
+---
+![image/png](https://huggingface.co/flammenai/Mahou-1.0-mistral-7B/resolve/main/mahou1.png)
+# Mahou-1.3a-mistral-7B
+Mahou is designed to provide short messages in a conversational context. It is capable of casual conversation and character roleplay.
+### Chat Format
+This model has been trained to use ChatML format. Note the additional tokens in [tokenizer_config.json](tokenizer_config.json).
+```
+<|im_start|>system
+{{system}}<|im_end|>
+<|im_start|>{{char}}
+{{message}}<|im_end|>
+<|im_start|>{{user}}
+{{message}}<|im_end|>
+```
+### Roleplay Format
+- Speech without quotes.
+- Actions in `*asterisks*`
+```
+*leans against wall cooly* so like, i just casted a super strong spell at magician academy today, not gonna lie, felt badass.
+```
+### ST Settings
+1. Use ChatML for the Context Template.
+2. Enable Instruct Mode.
+3. Use the [Mahou preset](https://huggingface.co/datasets/flammenai/Mahou-ST-ChatML-Instruct/raw/main/Mahou.json).
+4. Recommended: Add newline as a stopping string: `["\n"]`
+### Method
+DPO finetuned for 6 epochs using an A100 on Google Colab.
+[Fine-tune a Mistral-7b model with Direct Preference Optimization](https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac) - [Maxime Labonne](https://huggingface.co/mlabonne)