aaronday3's picture
Update README.md (#1)
169480b verified
metadata
base_model:
  - anthracite-org/magnum-12b-v2
  - nothingiisreal/MN-12B-Celeste-V1.9
library_name: transformers
tags:
  - mergekit
  - merge

Mistral Nemo 12B Starcannon v3

This is a merge of pre-trained language models created using mergekit.
Static GGUF (by Mradermacher)
Imatrix GGUF (by Mradermacher)
EXL2 (by kingbri of RoyalLab)

Merge Details

Merge Method

This model was merged using the TIES merge method using nothingiisreal/MN-12B-Celeste-V1.9 as a base.

Merge Fodder

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
    - model: anthracite-org/magnum-12b-v2
      parameters:
        density: 0.3
        weight: 0.5
    - model: nothingiisreal/MN-12B-Celeste-V1.9
      parameters:
        density: 0.7
        weight: 0.5

merge_method: ties
base_model: nothingiisreal/MN-12B-Celeste-V1.9
parameters:
    normalize: true
    int8_mask: true
dtype: bfloat16