Edit model card

image/png

Temitoku x Mitekuto

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: passthrough
slices:
- sources:
  - layer_range: [0, 19]
    model: unsloth/Mistral-Small-Instruct-2409
# Original L19
- sources:
  - layer_range: [19, 20]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L19
- sources:
  - layer_range: [19, 20]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L19
- sources:
  - layer_range: [19, 20]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L20
- sources:
  - layer_range: [20, 21]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L20
- sources:
  - layer_range: [20, 21]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L20
- sources:
  - layer_range: [20, 21]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L21
- sources:
  - layer_range: [21, 22]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L21
- sources:
  - layer_range: [21, 22]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L21
- sources:
  - layer_range: [21, 22]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L22
- sources:
  - layer_range: [22, 23]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L22
- sources:
  - layer_range: [22, 23]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L22
- sources:
  - layer_range: [22, 23]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L23
- sources:
  - layer_range: [23, 24]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L23
- sources:
  - layer_range: [23, 24]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L23
- sources:
  - layer_range: [23, 24]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L24
- sources:
  - layer_range: [24, 25]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L24
- sources:
  - layer_range: [24, 25]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L24
- sources:
  - layer_range: [24, 25]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L25
- sources:
  - layer_range: [25, 26]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L25
- sources:
  - layer_range: [25, 26]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L25
- sources:
  - layer_range: [25, 26]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L26
- sources:
  - layer_range: [26, 27]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L26
- sources:
  - layer_range: [26, 27]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L26
- sources:
  - layer_range: [26, 27]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L27
- sources:
  - layer_range: [27, 28]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L27
- sources:
  - layer_range: [27, 28]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L27
- sources:
  - layer_range: [27, 28]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L28
- sources:
  - layer_range: [28, 29]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L28
- sources:
  - layer_range: [28, 29]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L28
- sources:
  - layer_range: [28, 29]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L29
- sources:
  - layer_range: [29, 30]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L29
- sources:
  - layer_range: [29, 30]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L29
- sources:
  - layer_range: [29, 30]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L30
- sources:
  - layer_range: [30, 31]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L30
- sources:
  - layer_range: [30, 31]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L30
- sources:
  - layer_range: [30, 31]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L31
- sources:
  - layer_range: [31, 32]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L31
- sources:
  - layer_range: [31, 32]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L31
- sources:
  - layer_range: [31, 32]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L32
- sources:
  - layer_range: [32, 33]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L32
- sources:
  - layer_range: [32, 33]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L32
- sources:
  - layer_range: [32, 33]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L33
- sources:
  - layer_range: [33, 34]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L33
- sources:
  - layer_range: [33, 34]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L33
- sources:
  - layer_range: [33, 34]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L34
- sources:
  - layer_range: [34, 35]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L34
- sources:
  - layer_range: [34, 35]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L34
- sources:
  - layer_range: [34, 35]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L35
- sources:
  - layer_range: [35, 36]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L35
- sources:
  - layer_range: [35, 36]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L35
- sources:
  - layer_range: [35, 36]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L36
- sources:
  - layer_range: [36, 37]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L36
- sources:
  - layer_range: [36, 37]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L36
- sources:
  - layer_range: [36, 37]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L37
- sources:
  - layer_range: [37, 38]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L37
- sources:
  - layer_range: [37, 38]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L37
- sources:
  - layer_range: [37, 38]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L38
- sources:
  - layer_range: [38, 39]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L38
- sources:
  - layer_range: [38, 39]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L38
- sources:
  - layer_range: [38, 39]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L39
- sources:
  - layer_range: [39, 40]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L39
- sources:
  - layer_range: [39, 40]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L39
- sources:
  - layer_range: [39, 40]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Original L40
- sources:
  - layer_range: [40, 41]
    model: unsloth/Mistral-Small-Instruct-2409
# Dupe A of L40
- sources:
  - layer_range: [40, 41]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# Dupe B of L40
- sources:
  - layer_range: [40, 41]
    model: unsloth/Mistral-Small-Instruct-2409
    parameters:
      scale:
      - filter: o_proj
        value: 0.0
      - filter: down_proj
        value: 0.0
      - value: 1.0
# ... REPEAT UNTIL 41
- sources:
  - layer_range: [41, 55]
    model: unsloth/Mistral-Small-Instruct-2409
Downloads last month
22
Safetensors
Model size
39B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for TheDrummer/MS-Interleaved-Upscale-39B

Finetuned
(9)
this model
Quantizations
2 models