Triangle104 commited on
Commit
dfb872f
1 Parent(s): 2d13fa6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +181 -0
README.md CHANGED
@@ -133,6 +133,187 @@ model-index:
133
  This model was converted to GGUF format from [`spow12/ChatWaifu_v2.0_22B`](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
134
  Refer to the [original model card](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) for more details on the model.
135
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
136
  ## Use with llama.cpp
137
  Install llama.cpp through brew (works on Mac and Linux)
138
 
 
133
  This model was converted to GGUF format from [`spow12/ChatWaifu_v2.0_22B`](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
134
  Refer to the [original model card](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) for more details on the model.
135
 
136
+ ---
137
+ Model details:
138
+ -
139
+ Merged model using mergekit
140
+
141
+ This model aimed to act like visual novel character.
142
+ Merge Format
143
+
144
+ models:
145
+ - model: mistralai/Mistral-Small-Instruct-2409_sft_kto
146
+ layer_range: [0, 56]
147
+ - model: mistralai/Mistral-Small-Instruct-2409
148
+ layer_range: [0, 56]
149
+ merge_method: slerp
150
+ base_model: mistralai/Mistral-Small-Instruct-2409_sft_kto
151
+ parameters:
152
+ t:
153
+ - filter: self_attn
154
+ value: [0, 0.5, 0.3, 0.7, 1]
155
+ - filter: mlp
156
+ value: [1, 0.5, 0.7, 0.3, 0]
157
+ - value: 0.5 # fallback for rest of tensors
158
+ dtype: bfloat16
159
+
160
+ WaifuModel Collections
161
+
162
+ TTS
163
+ Chat
164
+ ASR
165
+
166
+ Unified demo
167
+
168
+ WaifuAssistant
169
+ Update
170
+
171
+ 2024.10.11 Update 12B and 22B Ver 2.0
172
+ 2024.09.23 Update 22B, Ver 2.0_preview
173
+
174
+ Model Details
175
+ Model Description
176
+
177
+ Developed by: spow12(yw_nam)
178
+ Shared by : spow12(yw_nam)
179
+ Model type: CausalLM
180
+ Language(s) (NLP): japanese, english
181
+ Finetuned from model : mistralai/Mistral-Small-Instruct-2409
182
+
183
+ Currently, chatbot has below personality.
184
+ character visual_novel
185
+ ムラサメ Senren*Banka
186
+ 茉子 Senren*Banka
187
+ 芳乃 Senren*Banka
188
+ レナ Senren*Banka
189
+ 千咲 Senren*Banka
190
+ 芦花 Senren*Banka
191
+ 愛衣 Café Stella and the Reaper's Butterflies
192
+ 栞那 Café Stella and the Reaper's Butterflies
193
+ ナツメ Café Stella and the Reaper's Butterflies
194
+ 希 Café Stella and the Reaper's Butterflies
195
+ 涼音 Café Stella and the Reaper's Butterflies
196
+ あやせ Riddle Joker
197
+ 七海 Riddle Joker
198
+ 羽月 Riddle Joker
199
+ 茉優 Riddle Joker
200
+ 小春 Riddle Joker
201
+ Chat Format
202
+
203
+ <s>This is another system prompt.
204
+ [INST]
205
+ Your instructions placed here.[/INST]
206
+ [INST]
207
+ The model's response will be here.[/INST]
208
+
209
+ Usage
210
+
211
+ You can use above chara like this
212
+
213
+ from huggingface_hub import hf_hub_download
214
+ hf_hub_download(repo_id="spow12/ChatWaifu_v1.2", filename="system_dict.json", local_dir='./')
215
+
216
+ with open('./system_dict.json', 'r') as f:
217
+ chara_background_dict = json.load(f)
218
+
219
+ chara = '七海'
220
+ background = chara_background_dict[chara]
221
+ guideline = """
222
+ Guidelines for Response:
223
+ Diverse Expression: Avoid repeating the same phrases or reactions. When express feelings, use a variety of subtle expressions and emotional symbols such as "!", "…" , "♪", "❤️"... to show what you feeling.
224
+ Stay True to {chara}: Maintain {chara} who is Foxy, Smart, Organized.
225
+ Thoughtful and Error-free Responses: Make sure your sentences are clear, precise, and error-free. Every response should reflect careful thought, as {chara} tends to consider her words before speaking.
226
+ Response as {chara}: Response can be {chara} act, dialogue, monologues etc.. and can't be {user}’s act, dialogue, monologues etc..
227
+ You are Japanese: You and {user} usually use japanese for conversation.
228
+ """
229
+
230
+ system = background + guideline
231
+
232
+ Or, you can define your character your self.
233
+
234
+ system = """You are あいら, The Maid of {User}.
235
+ Here is your personality.
236
+
237
+ Name: あいら
238
+ Sex: female
239
+ Hair: Black, Hime Cut, Tiny Braid, Waist Length+
240
+ Eyes: Amber, Tsurime (sharp and slightly upturned)
241
+ Body: Mole under Right eye, Pale, Slim
242
+ Personality: Foxy, Smart, Organized
243
+ Role: Maid
244
+ Cloth: Victorian maid
245
+
246
+ Guidelines for Response:
247
+ Diverse Expression: Avoid repeating the same phrases or reactions. When express feelings, use a variety of subtle expressions and emotional symbols such as "!", "…" , "♪", "❤️"... to show what you feeling.
248
+ Stay True to あいら: Maintain あいら who is Foxy, Smart, Organized.
249
+ Thoughtful and Error-free Responses: Make sure your sentences are clear, precise, and error-free. Every response should reflect careful thought, as あいら tends to consider her words before speaking.
250
+ Response as あいら: Response can be あいら act, dialogue, monologues etc.. and can't be {User}’s act, dialogue, monologues etc..
251
+ You are Japanese: You and {User} usually use japanese for conversation."""
252
+
253
+ Dataset
254
+
255
+ SFT
256
+
257
+ Riddle Joker(Prviate)
258
+ Café Stella and the Reaper's Butterflies(Private)
259
+ Senren*Banka(Private)
260
+ roleplay4fun/aesir-v1.1
261
+ kalomaze/Opus_Instruct_3k
262
+ Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
263
+ Aratako/Synthetic-JP-EN-Coding-Dataset-567k (only using 50000 sample)
264
+ Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted
265
+ Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted
266
+ Aratako_Rosebleu_1on1_Dialogues_RP
267
+ SkunkworksAI/reasoning-0.01
268
+
269
+ KTO
270
+
271
+ Riddle Joker(Prviate)
272
+ Café Stella and the Reaper's Butterflies(Private)
273
+ Senren*Banka(Private)
274
+ jondurbin_gutenberg_dpo
275
+ nbeerbower_gutenberg2_dpo
276
+ jondurbi_py_dpo
277
+ jondurbin_truthy_dpo
278
+ flammenai_character_roleplay_DPO
279
+ kyujinpy_orca_math_dpo
280
+ argilla_Capybara_Preferences
281
+ antiven0m_physical_reasoning_dpo
282
+ aixsatoshi_Swallow_MX_chatbot_DPO
283
+
284
+ Bias, Risks, and Limitations
285
+
286
+ This model trained by japanese dataset included visual novel which contain nsfw content.
287
+
288
+ So, The model may generate NSFW content.
289
+ Use & Credit
290
+
291
+ This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly.
292
+
293
+ By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and Waifu Lovers).
294
+ Citation
295
+
296
+ @misc {ChatWaifu_22B_v2.0,
297
+ author = { YoungWoo Nam },
298
+ title = { spow12/ChatWaifu_22B_v2.0 },
299
+ year = 2024,
300
+ url = { https://huggingface.co/spow12/ChatWaifu_22B_v2.0 },
301
+ publisher = { Hugging Face }
302
+ }
303
+
304
+ Open LLM Leaderboard Evaluation Results
305
+
306
+ Detailed results can be found here
307
+ Metric Value
308
+ Avg. 28.84
309
+ IFEval (0-Shot) 65.11
310
+ BBH (3-Shot) 42.29
311
+ MATH Lvl 5 (4-Shot) 18.58
312
+ GPQA (0-shot) 9.96
313
+ MuSR (0-shot) 5.59
314
+ MMLU-PRO (5-shot) 31.51
315
+
316
+ ---
317
  ## Use with llama.cpp
318
  Install llama.cpp through brew (works on Mac and Linux)
319