Edit model card

Image description

Tsunemoto GGUF's of FlatDolphinMaid-8x7B

This is a GGUF quantization of FlatDolphinMaid-8x7B.

Original Repo Link:

Original Repository

Original Model Card:


First experimental merge of Noromaid 8x7b (Instruct) and dolphin 8x7b. The idea behind this is to add a little more IQ to the model, because Noromaid was only trained on RP/ERP data. Dolphin 2.7 is the only real Mixtral finetune I consider "usable", and so the merging quest begin again kek.

Merged Dolphin 2.7 with Mixtral Base (Dolphin was at 1.0 weight) to get rid of ChatLM, and then I merged Noromaid 8x7b with the output, SLERP method.

This model feel better on the IQ chart and have the ~same average ERP score on ayumi bench' than Noromaid 8x7b, but it's softer and more prude too, it also have the typical Mixtral repeat issue at some point. Choose your poison.

image/png

Description

This repo contains fp16 files of FlatDolphinMaid-8x7B.

Models used

Custom format:

### Instruction:
{system prompt}

### Input:
{input}

### Response:
{reply}

If you want to support me, you can here.

Downloads last month
58
GGUF
Model size
46.7B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .