Aratako
/

LightChatAssistant-4x7B

Text Generation

Not-For-All-Audiences

nsfw

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

LightChatAssistant-4x7B / README.md

Aratako's picture

Update README.md

0e6f1b7 verified 7 months ago

|

history blame contribute delete

3.41 kB

	---
	license: apache-2.0
	language:
	- ja
	tags:
	- mistral
	- mixtral
	- not-for-all-audiences
	- nsfw
	base_model:
	- NTQAI/chatntq-ja-7b-v1.0
	- stabilityai/japanese-stablelm-instruct-gamma-7b
	- Elizezen/Antler-7B
	- Elizezen/Hameln-japanese-mistral-7B
	---

	# LightChatAssistant-4x7B
	[GGUF版はこちら/Click here for the GGUF version](https://huggingface.co/Aratako/LightChatAssistant-4x7B-GGUF)

	## 概要
	[Sdff-Ltba/LightChatAssistant-2x7B](https://huggingface.co/Sdff-Ltba/LightChatAssistant-2x7B)と全く同じ手法を用いつつ、対象を4モデルに拡張してそれらをMoEしたモデルです。素晴らしい手法を提案いただいた[@Sdff-Ltba](https://huggingface.co/Sdff-Ltba)さんに感謝します。

	モデルサイズは大きくなっていますが、推論に同時に使われるエキスパートは2つなので、GPUにオフロードさえできれば元の[Sdff-Ltba/LightChatAssistant-2x7B](https://huggingface.co/Sdff-Ltba/LightChatAssistant-2x7B)とほぼ同じような速度で動きます。


	以下の4モデルをChat Vectorの加算によって対話強化して素材に使用しています。
	- [NTQAI/chatntq-ja-7b-v1.0](https://huggingface.co/NTQAI/chatntq-ja-7b-v1.0)
	- [stabilityai/japanese-stablelm-instruct-gamma-7b](https://huggingface.co/stabilityai/japanese-stablelm-instruct-gamma-7b)
	- [Elizezen/Antler-7B](https://huggingface.co/Elizezen/Antler-7B)
	- [Elizezen/Hameln-japanese-mistral-7B](https://huggingface.co/Elizezen/Hameln-japanese-mistral-7B)

	Chat Vectorの適用方法は[Sdff-Ltba/LightChatAssistant-2x7B](https://huggingface.co/Sdff-Ltba/LightChatAssistant-2x7B)と同じで、Mistral-7B-Instruct-v0.2とMistral-7B-v0.1の差分を0.8倍してそれぞれのモデルに足しています。

	その後、[mergekit](https://github.com/cg123/mergekit)を使い、4モデルをMoEしました。MoEのconfigは以下の通りです。

	```yaml
	base_model: ./chatntq-chatvector
	gate_mode: hidden
	dtype: bfloat16
	experts:
	- source_model: ./chatntq-chatvector
	positive_prompts:
	- "roleplay"
	- "question"
	- "answer"
	- "chat"
	- "companion"
	- "character"
	- "math"
	- "code"
	- "[Mode: Roleplay]"
	- "[Mode: Chat]"
	negative_prompts:
	- "storywriting"
	- "book"
	- "story"
	- "chapter"
	- "tale"
	- "history"
	- "write"
	- "novel"
	- "[Mode: Writing]"
	- source_model: ./japanese-stablelm-instruct-gamma-chatvector
	positive_prompts:
	- "roleplay"
	- "question"
	- "answer"
	- "chat"
	- "companion"
	- "character"
	- "math"
	- "code"
	- "[Mode: Roleplay]"
	- "[Mode: Chat]"
	negative_prompts:
	- "storywriting"
	- "book"
	- "story"
	- "chapter"
	- "tale"
	- "history"
	- "write"
	- "novel"
	- "[Mode: Writing]"
	- source_model: ./Hameln-japanese-mistral-chatvector
	positive_prompts:
	- "sex"
	- "storywriting"
	- "erotic"
	- "fuck"
	- "orgasm"
	- "uncensored"
	- "book"
	- "story"
	- "chapter"
	- "tale"
	- "history"
	- "write"
	- "novel"
	- "[Mode: Writing]"
	- source_model: ./Antler-chatvector
	positive_prompts:
	- "sex"
	- "storywriting"
	- "erotic"
	- "fuck"
	- "orgasm"
	- "uncensored"
	- "book"
	- "story"
	- "chapter"
	- "tale"
	- "history"
	- "write"
	- "novel"
	- "[Mode: Writing]"
	tokenizer_source: union
	```