Triangle104
commited on
Commit
•
dfb872f
1
Parent(s):
2d13fa6
Update README.md
Browse files
README.md
CHANGED
@@ -133,6 +133,187 @@ model-index:
|
|
133 |
This model was converted to GGUF format from [`spow12/ChatWaifu_v2.0_22B`](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
134 |
Refer to the [original model card](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) for more details on the model.
|
135 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
136 |
## Use with llama.cpp
|
137 |
Install llama.cpp through brew (works on Mac and Linux)
|
138 |
|
|
|
133 |
This model was converted to GGUF format from [`spow12/ChatWaifu_v2.0_22B`](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
134 |
Refer to the [original model card](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) for more details on the model.
|
135 |
|
136 |
+
---
|
137 |
+
Model details:
|
138 |
+
-
|
139 |
+
Merged model using mergekit
|
140 |
+
|
141 |
+
This model aimed to act like visual novel character.
|
142 |
+
Merge Format
|
143 |
+
|
144 |
+
models:
|
145 |
+
- model: mistralai/Mistral-Small-Instruct-2409_sft_kto
|
146 |
+
layer_range: [0, 56]
|
147 |
+
- model: mistralai/Mistral-Small-Instruct-2409
|
148 |
+
layer_range: [0, 56]
|
149 |
+
merge_method: slerp
|
150 |
+
base_model: mistralai/Mistral-Small-Instruct-2409_sft_kto
|
151 |
+
parameters:
|
152 |
+
t:
|
153 |
+
- filter: self_attn
|
154 |
+
value: [0, 0.5, 0.3, 0.7, 1]
|
155 |
+
- filter: mlp
|
156 |
+
value: [1, 0.5, 0.7, 0.3, 0]
|
157 |
+
- value: 0.5 # fallback for rest of tensors
|
158 |
+
dtype: bfloat16
|
159 |
+
|
160 |
+
WaifuModel Collections
|
161 |
+
|
162 |
+
TTS
|
163 |
+
Chat
|
164 |
+
ASR
|
165 |
+
|
166 |
+
Unified demo
|
167 |
+
|
168 |
+
WaifuAssistant
|
169 |
+
Update
|
170 |
+
|
171 |
+
2024.10.11 Update 12B and 22B Ver 2.0
|
172 |
+
2024.09.23 Update 22B, Ver 2.0_preview
|
173 |
+
|
174 |
+
Model Details
|
175 |
+
Model Description
|
176 |
+
|
177 |
+
Developed by: spow12(yw_nam)
|
178 |
+
Shared by : spow12(yw_nam)
|
179 |
+
Model type: CausalLM
|
180 |
+
Language(s) (NLP): japanese, english
|
181 |
+
Finetuned from model : mistralai/Mistral-Small-Instruct-2409
|
182 |
+
|
183 |
+
Currently, chatbot has below personality.
|
184 |
+
character visual_novel
|
185 |
+
ムラサメ Senren*Banka
|
186 |
+
茉子 Senren*Banka
|
187 |
+
芳乃 Senren*Banka
|
188 |
+
レナ Senren*Banka
|
189 |
+
千咲 Senren*Banka
|
190 |
+
芦花 Senren*Banka
|
191 |
+
愛衣 Café Stella and the Reaper's Butterflies
|
192 |
+
栞那 Café Stella and the Reaper's Butterflies
|
193 |
+
ナツメ Café Stella and the Reaper's Butterflies
|
194 |
+
希 Café Stella and the Reaper's Butterflies
|
195 |
+
涼音 Café Stella and the Reaper's Butterflies
|
196 |
+
あやせ Riddle Joker
|
197 |
+
七海 Riddle Joker
|
198 |
+
羽月 Riddle Joker
|
199 |
+
茉優 Riddle Joker
|
200 |
+
小春 Riddle Joker
|
201 |
+
Chat Format
|
202 |
+
|
203 |
+
<s>This is another system prompt.
|
204 |
+
[INST]
|
205 |
+
Your instructions placed here.[/INST]
|
206 |
+
[INST]
|
207 |
+
The model's response will be here.[/INST]
|
208 |
+
|
209 |
+
Usage
|
210 |
+
|
211 |
+
You can use above chara like this
|
212 |
+
|
213 |
+
from huggingface_hub import hf_hub_download
|
214 |
+
hf_hub_download(repo_id="spow12/ChatWaifu_v1.2", filename="system_dict.json", local_dir='./')
|
215 |
+
|
216 |
+
with open('./system_dict.json', 'r') as f:
|
217 |
+
chara_background_dict = json.load(f)
|
218 |
+
|
219 |
+
chara = '七海'
|
220 |
+
background = chara_background_dict[chara]
|
221 |
+
guideline = """
|
222 |
+
Guidelines for Response:
|
223 |
+
Diverse Expression: Avoid repeating the same phrases or reactions. When express feelings, use a variety of subtle expressions and emotional symbols such as "!", "…" , "♪", "❤️"... to show what you feeling.
|
224 |
+
Stay True to {chara}: Maintain {chara} who is Foxy, Smart, Organized.
|
225 |
+
Thoughtful and Error-free Responses: Make sure your sentences are clear, precise, and error-free. Every response should reflect careful thought, as {chara} tends to consider her words before speaking.
|
226 |
+
Response as {chara}: Response can be {chara} act, dialogue, monologues etc.. and can't be {user}’s act, dialogue, monologues etc..
|
227 |
+
You are Japanese: You and {user} usually use japanese for conversation.
|
228 |
+
"""
|
229 |
+
|
230 |
+
system = background + guideline
|
231 |
+
|
232 |
+
Or, you can define your character your self.
|
233 |
+
|
234 |
+
system = """You are あいら, The Maid of {User}.
|
235 |
+
Here is your personality.
|
236 |
+
|
237 |
+
Name: あいら
|
238 |
+
Sex: female
|
239 |
+
Hair: Black, Hime Cut, Tiny Braid, Waist Length+
|
240 |
+
Eyes: Amber, Tsurime (sharp and slightly upturned)
|
241 |
+
Body: Mole under Right eye, Pale, Slim
|
242 |
+
Personality: Foxy, Smart, Organized
|
243 |
+
Role: Maid
|
244 |
+
Cloth: Victorian maid
|
245 |
+
|
246 |
+
Guidelines for Response:
|
247 |
+
Diverse Expression: Avoid repeating the same phrases or reactions. When express feelings, use a variety of subtle expressions and emotional symbols such as "!", "…" , "♪", "❤️"... to show what you feeling.
|
248 |
+
Stay True to あいら: Maintain あいら who is Foxy, Smart, Organized.
|
249 |
+
Thoughtful and Error-free Responses: Make sure your sentences are clear, precise, and error-free. Every response should reflect careful thought, as あいら tends to consider her words before speaking.
|
250 |
+
Response as あいら: Response can be あいら act, dialogue, monologues etc.. and can't be {User}’s act, dialogue, monologues etc..
|
251 |
+
You are Japanese: You and {User} usually use japanese for conversation."""
|
252 |
+
|
253 |
+
Dataset
|
254 |
+
|
255 |
+
SFT
|
256 |
+
|
257 |
+
Riddle Joker(Prviate)
|
258 |
+
Café Stella and the Reaper's Butterflies(Private)
|
259 |
+
Senren*Banka(Private)
|
260 |
+
roleplay4fun/aesir-v1.1
|
261 |
+
kalomaze/Opus_Instruct_3k
|
262 |
+
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
|
263 |
+
Aratako/Synthetic-JP-EN-Coding-Dataset-567k (only using 50000 sample)
|
264 |
+
Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted
|
265 |
+
Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted
|
266 |
+
Aratako_Rosebleu_1on1_Dialogues_RP
|
267 |
+
SkunkworksAI/reasoning-0.01
|
268 |
+
|
269 |
+
KTO
|
270 |
+
|
271 |
+
Riddle Joker(Prviate)
|
272 |
+
Café Stella and the Reaper's Butterflies(Private)
|
273 |
+
Senren*Banka(Private)
|
274 |
+
jondurbin_gutenberg_dpo
|
275 |
+
nbeerbower_gutenberg2_dpo
|
276 |
+
jondurbi_py_dpo
|
277 |
+
jondurbin_truthy_dpo
|
278 |
+
flammenai_character_roleplay_DPO
|
279 |
+
kyujinpy_orca_math_dpo
|
280 |
+
argilla_Capybara_Preferences
|
281 |
+
antiven0m_physical_reasoning_dpo
|
282 |
+
aixsatoshi_Swallow_MX_chatbot_DPO
|
283 |
+
|
284 |
+
Bias, Risks, and Limitations
|
285 |
+
|
286 |
+
This model trained by japanese dataset included visual novel which contain nsfw content.
|
287 |
+
|
288 |
+
So, The model may generate NSFW content.
|
289 |
+
Use & Credit
|
290 |
+
|
291 |
+
This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly.
|
292 |
+
|
293 |
+
By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and Waifu Lovers).
|
294 |
+
Citation
|
295 |
+
|
296 |
+
@misc {ChatWaifu_22B_v2.0,
|
297 |
+
author = { YoungWoo Nam },
|
298 |
+
title = { spow12/ChatWaifu_22B_v2.0 },
|
299 |
+
year = 2024,
|
300 |
+
url = { https://huggingface.co/spow12/ChatWaifu_22B_v2.0 },
|
301 |
+
publisher = { Hugging Face }
|
302 |
+
}
|
303 |
+
|
304 |
+
Open LLM Leaderboard Evaluation Results
|
305 |
+
|
306 |
+
Detailed results can be found here
|
307 |
+
Metric Value
|
308 |
+
Avg. 28.84
|
309 |
+
IFEval (0-Shot) 65.11
|
310 |
+
BBH (3-Shot) 42.29
|
311 |
+
MATH Lvl 5 (4-Shot) 18.58
|
312 |
+
GPQA (0-shot) 9.96
|
313 |
+
MuSR (0-shot) 5.59
|
314 |
+
MMLU-PRO (5-shot) 31.51
|
315 |
+
|
316 |
+
---
|
317 |
## Use with llama.cpp
|
318 |
Install llama.cpp through brew (works on Mac and Linux)
|
319 |
|