Edit model card

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

This model is a merge of all of my SOVL models, in the hopes to create the most unhinged and wild model possible. But in Mixtral fashion!

It may be insane, it may be incoherent. I can't load it :3

Merge Method

This model was merged using the Mixture Of Experts method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model: saishf/Ortho-SOVL-8B-L3
gate_mode: random
dtype: bfloat16
experts:
  - source_model: saishf/Ortho-SOVL-8B-L3
  - source_model: saishf/SOVLish-Maid-L3-8B
  - source_model: saishf/Merge-Mayhem-L3-V2.1
  - source_model: saishf/Merge-Mayhem-L3-V2

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 66.76
AI2 Reasoning Challenge (25-Shot) 61.95
HellaSwag (10-Shot) 79.38
MMLU (5-Shot) 65.49
TruthfulQA (0-shot) 51.48
Winogrande (5-shot) 75.69
GSM8k (5-shot) 66.57
Downloads last month
15
Safetensors
Model size
24.9B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for saishf/Llama4Some-SOVL-4x8B-L3-V1

Evaluation results