rAIfle
/

Sloppy-Wingman-8x7B-hf

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rAIfle commited on Feb 21

Commit

7ca921a

•

1 Parent(s): a51fe3b

Update README.md

Files changed (1) hide show

README.md +37 -0

README.md CHANGED Viewed

@@ -6,7 +6,44 @@ tags:
 ---
 # Sloppy-Wingman-8x7B-hf
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details

 ---
 # Sloppy-Wingman-8x7B-hf
+![Sloppy Wingman](https://files.catbox.moe/7ay3me.png)
+Big slop, good model.
+Running better at slightly higher temp (1.1-ish) than usual, along with 0.05 MinP and 0.28 snoot.
+Bog-standard ChatML works best imo, but Alpaca and Mixtral formats work (to some degree) too.
+Parts:
+```yaml
+models:
+  - model: mistralai/Mixtral-8x7B-v0.1+retrieval-bar/Mixtral-8x7B-v0.1_case-briefs
+    parameters:
+      weight: 0.33
+  - model: mistralai/Mixtral-8x7B-v0.1+wandb/Mixtral-8x7b-Remixtral
+    parameters:
+      weight: 0.33
+merge_method: task_arithmetic
+base_model: mistralai/Mixtral-8x7B-v0.1
+dtype: float16
+```
+and
+```yaml
+models:
+  - model: mistralai/Mixtral-8x7B-Instruct-v0.1+/ai/LLM/tmp/pefts/daybreak-peft/mixtral-8x7b
+    parameters:
+      weight: 0.85
+  - model: notstoic/Nous-Hermes-2-Mixtruct-v0.1-8x7B-DPO-DARE_TIES
+    parameters:
+      weight: 0.25
+  - model: ycros/BagelWorldTour-8x7B
+    parameters:
+      weight: 0.1
+merge_method: task_arithmetic
+base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
+dtype: float16
+```
+SLERP:ed together as per below.
+---
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details