|
--- |
|
language: |
|
- en |
|
- ja |
|
license: cc-by-nc-4.0 |
|
library_name: transformers |
|
tags: |
|
- nsfw |
|
- Visual novel |
|
- roleplay |
|
- mergekit |
|
- merge |
|
- llama-cpp |
|
- gguf-my-repo |
|
base_model: spow12/ChatWaifu_v2.0_22B |
|
datasets: |
|
- roleplay4fun/aesir-v1.1 |
|
- kalomaze/Opus_Instruct_3k |
|
- Gryphe/Sonnet3.5-SlimOrcaDedupCleaned |
|
- Aratako/Synthetic-JP-EN-Coding-Dataset-567k |
|
- Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted |
|
- Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted |
|
- Aratako_Rosebleu_1on1_Dialogues_RP |
|
- SkunkworksAI/reasoning-0.01 |
|
- jondurbin_gutenberg_dpo |
|
- nbeerbower_gutenberg2_dpo |
|
- jondurbi_py_dpo |
|
- jondurbin_truthy_dpo |
|
- flammenai_character_roleplay_DPO |
|
- kyujinpy_orca_math_dpo |
|
- argilla_Capybara_Preferences |
|
- antiven0m_physical_reasoning_dpo |
|
- aixsatoshi_Swallow_MX_chatbot_DPO |
|
pipeline_tag: text-generation |
|
model-index: |
|
- name: ChatWaifu_v2.0_22B |
|
results: |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: IFEval (0-Shot) |
|
type: HuggingFaceH4/ifeval |
|
args: |
|
num_few_shot: 0 |
|
metrics: |
|
- type: inst_level_strict_acc and prompt_level_strict_acc |
|
value: 65.11 |
|
name: strict accuracy |
|
source: |
|
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=spow12/ChatWaifu_v2.0_22B |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: BBH (3-Shot) |
|
type: BBH |
|
args: |
|
num_few_shot: 3 |
|
metrics: |
|
- type: acc_norm |
|
value: 42.29 |
|
name: normalized accuracy |
|
source: |
|
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=spow12/ChatWaifu_v2.0_22B |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: MATH Lvl 5 (4-Shot) |
|
type: hendrycks/competition_math |
|
args: |
|
num_few_shot: 4 |
|
metrics: |
|
- type: exact_match |
|
value: 18.58 |
|
name: exact match |
|
source: |
|
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=spow12/ChatWaifu_v2.0_22B |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: GPQA (0-shot) |
|
type: Idavidrein/gpqa |
|
args: |
|
num_few_shot: 0 |
|
metrics: |
|
- type: acc_norm |
|
value: 9.96 |
|
name: acc_norm |
|
source: |
|
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=spow12/ChatWaifu_v2.0_22B |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: MuSR (0-shot) |
|
type: TAUR-Lab/MuSR |
|
args: |
|
num_few_shot: 0 |
|
metrics: |
|
- type: acc_norm |
|
value: 5.59 |
|
name: acc_norm |
|
source: |
|
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=spow12/ChatWaifu_v2.0_22B |
|
name: Open LLM Leaderboard |
|
- task: |
|
type: text-generation |
|
name: Text Generation |
|
dataset: |
|
name: MMLU-PRO (5-shot) |
|
type: TIGER-Lab/MMLU-Pro |
|
config: main |
|
split: test |
|
args: |
|
num_few_shot: 5 |
|
metrics: |
|
- type: acc |
|
value: 31.51 |
|
name: accuracy |
|
source: |
|
url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=spow12/ChatWaifu_v2.0_22B |
|
name: Open LLM Leaderboard |
|
--- |
|
|
|
# Triangle104/ChatWaifu_v2.0_22B-Q4_K_M-GGUF |
|
This model was converted to GGUF format from [`spow12/ChatWaifu_v2.0_22B`](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space. |
|
Refer to the [original model card](https://huggingface.co/spow12/ChatWaifu_v2.0_22B) for more details on the model. |
|
|
|
--- |
|
Model details: |
|
- |
|
Merged model using mergekit |
|
|
|
This model aimed to act like visual novel character. |
|
Merge Format |
|
|
|
models: |
|
- model: mistralai/Mistral-Small-Instruct-2409_sft_kto |
|
layer_range: [0, 56] |
|
- model: mistralai/Mistral-Small-Instruct-2409 |
|
layer_range: [0, 56] |
|
merge_method: slerp |
|
base_model: mistralai/Mistral-Small-Instruct-2409_sft_kto |
|
parameters: |
|
t: |
|
- filter: self_attn |
|
value: [0, 0.5, 0.3, 0.7, 1] |
|
- filter: mlp |
|
value: [1, 0.5, 0.7, 0.3, 0] |
|
- value: 0.5 # fallback for rest of tensors |
|
dtype: bfloat16 |
|
|
|
WaifuModel Collections |
|
|
|
TTS |
|
Chat |
|
ASR |
|
|
|
Unified demo |
|
|
|
WaifuAssistant |
|
Update |
|
|
|
2024.10.11 Update 12B and 22B Ver 2.0 |
|
2024.09.23 Update 22B, Ver 2.0_preview |
|
|
|
Model Details |
|
Model Description |
|
|
|
Developed by: spow12(yw_nam) |
|
Shared by : spow12(yw_nam) |
|
Model type: CausalLM |
|
Language(s) (NLP): japanese, english |
|
Finetuned from model : mistralai/Mistral-Small-Instruct-2409 |
|
|
|
Currently, chatbot has below personality. |
|
character visual_novel |
|
ムラサメ Senren*Banka |
|
茉子 Senren*Banka |
|
芳乃 Senren*Banka |
|
レナ Senren*Banka |
|
千咲 Senren*Banka |
|
芦花 Senren*Banka |
|
愛衣 Café Stella and the Reaper's Butterflies |
|
栞那 Café Stella and the Reaper's Butterflies |
|
ナツメ Café Stella and the Reaper's Butterflies |
|
希 Café Stella and the Reaper's Butterflies |
|
涼音 Café Stella and the Reaper's Butterflies |
|
あやせ Riddle Joker |
|
七海 Riddle Joker |
|
羽月 Riddle Joker |
|
茉優 Riddle Joker |
|
小春 Riddle Joker |
|
Chat Format |
|
|
|
<s>This is another system prompt. |
|
[INST] |
|
Your instructions placed here.[/INST] |
|
[INST] |
|
The model's response will be here.[/INST] |
|
|
|
Usage |
|
|
|
You can use above chara like this |
|
|
|
from huggingface_hub import hf_hub_download |
|
hf_hub_download(repo_id="spow12/ChatWaifu_v1.2", filename="system_dict.json", local_dir='./') |
|
|
|
with open('./system_dict.json', 'r') as f: |
|
chara_background_dict = json.load(f) |
|
|
|
chara = '七海' |
|
background = chara_background_dict[chara] |
|
guideline = """ |
|
Guidelines for Response: |
|
Diverse Expression: Avoid repeating the same phrases or reactions. When express feelings, use a variety of subtle expressions and emotional symbols such as "!", "…" , "♪", "❤️"... to show what you feeling. |
|
Stay True to {chara}: Maintain {chara} who is Foxy, Smart, Organized. |
|
Thoughtful and Error-free Responses: Make sure your sentences are clear, precise, and error-free. Every response should reflect careful thought, as {chara} tends to consider her words before speaking. |
|
Response as {chara}: Response can be {chara} act, dialogue, monologues etc.. and can't be {user}’s act, dialogue, monologues etc.. |
|
You are Japanese: You and {user} usually use japanese for conversation. |
|
""" |
|
|
|
system = background + guideline |
|
|
|
Or, you can define your character your self. |
|
|
|
system = """You are あいら, The Maid of {User}. |
|
Here is your personality. |
|
|
|
Name: あいら |
|
Sex: female |
|
Hair: Black, Hime Cut, Tiny Braid, Waist Length+ |
|
Eyes: Amber, Tsurime (sharp and slightly upturned) |
|
Body: Mole under Right eye, Pale, Slim |
|
Personality: Foxy, Smart, Organized |
|
Role: Maid |
|
Cloth: Victorian maid |
|
|
|
Guidelines for Response: |
|
Diverse Expression: Avoid repeating the same phrases or reactions. When express feelings, use a variety of subtle expressions and emotional symbols such as "!", "…" , "♪", "❤️"... to show what you feeling. |
|
Stay True to あいら: Maintain あいら who is Foxy, Smart, Organized. |
|
Thoughtful and Error-free Responses: Make sure your sentences are clear, precise, and error-free. Every response should reflect careful thought, as あいら tends to consider her words before speaking. |
|
Response as あいら: Response can be あいら act, dialogue, monologues etc.. and can't be {User}’s act, dialogue, monologues etc.. |
|
You are Japanese: You and {User} usually use japanese for conversation.""" |
|
|
|
Dataset |
|
|
|
SFT |
|
|
|
Riddle Joker(Prviate) |
|
Café Stella and the Reaper's Butterflies(Private) |
|
Senren*Banka(Private) |
|
roleplay4fun/aesir-v1.1 |
|
kalomaze/Opus_Instruct_3k |
|
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned |
|
Aratako/Synthetic-JP-EN-Coding-Dataset-567k (only using 50000 sample) |
|
Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted |
|
Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted |
|
Aratako_Rosebleu_1on1_Dialogues_RP |
|
SkunkworksAI/reasoning-0.01 |
|
|
|
KTO |
|
|
|
Riddle Joker(Prviate) |
|
Café Stella and the Reaper's Butterflies(Private) |
|
Senren*Banka(Private) |
|
jondurbin_gutenberg_dpo |
|
nbeerbower_gutenberg2_dpo |
|
jondurbi_py_dpo |
|
jondurbin_truthy_dpo |
|
flammenai_character_roleplay_DPO |
|
kyujinpy_orca_math_dpo |
|
argilla_Capybara_Preferences |
|
antiven0m_physical_reasoning_dpo |
|
aixsatoshi_Swallow_MX_chatbot_DPO |
|
|
|
Bias, Risks, and Limitations |
|
|
|
This model trained by japanese dataset included visual novel which contain nsfw content. |
|
|
|
So, The model may generate NSFW content. |
|
Use & Credit |
|
|
|
This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly. |
|
|
|
By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and Waifu Lovers). |
|
Citation |
|
|
|
@misc {ChatWaifu_22B_v2.0, |
|
author = { YoungWoo Nam }, |
|
title = { spow12/ChatWaifu_22B_v2.0 }, |
|
year = 2024, |
|
url = { https://huggingface.co/spow12/ChatWaifu_22B_v2.0 }, |
|
publisher = { Hugging Face } |
|
} |
|
|
|
Open LLM Leaderboard Evaluation Results |
|
|
|
Detailed results can be found here |
|
Metric Value |
|
Avg. 28.84 |
|
IFEval (0-Shot) 65.11 |
|
BBH (3-Shot) 42.29 |
|
MATH Lvl 5 (4-Shot) 18.58 |
|
GPQA (0-shot) 9.96 |
|
MuSR (0-shot) 5.59 |
|
MMLU-PRO (5-shot) 31.51 |
|
|
|
--- |
|
## Use with llama.cpp |
|
Install llama.cpp through brew (works on Mac and Linux) |
|
|
|
```bash |
|
brew install llama.cpp |
|
|
|
``` |
|
Invoke the llama.cpp server or the CLI. |
|
|
|
### CLI: |
|
```bash |
|
llama-cli --hf-repo Triangle104/ChatWaifu_v2.0_22B-Q4_K_M-GGUF --hf-file chatwaifu_v2.0_22b-q4_k_m.gguf -p "The meaning to life and the universe is" |
|
``` |
|
|
|
### Server: |
|
```bash |
|
llama-server --hf-repo Triangle104/ChatWaifu_v2.0_22B-Q4_K_M-GGUF --hf-file chatwaifu_v2.0_22b-q4_k_m.gguf -c 2048 |
|
``` |
|
|
|
Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well. |
|
|
|
Step 1: Clone llama.cpp from GitHub. |
|
``` |
|
git clone https://github.com/ggerganov/llama.cpp |
|
``` |
|
|
|
Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux). |
|
``` |
|
cd llama.cpp && LLAMA_CURL=1 make |
|
``` |
|
|
|
Step 3: Run inference through the main binary. |
|
``` |
|
./llama-cli --hf-repo Triangle104/ChatWaifu_v2.0_22B-Q4_K_M-GGUF --hf-file chatwaifu_v2.0_22b-q4_k_m.gguf -p "The meaning to life and the universe is" |
|
``` |
|
or |
|
``` |
|
./llama-server --hf-repo Triangle104/ChatWaifu_v2.0_22B-Q4_K_M-GGUF --hf-file chatwaifu_v2.0_22b-q4_k_m.gguf -c 2048 |
|
``` |
|
|