--- base_model: [] library_name: transformers tags: - mergekit - merge ---

L3-Stheno-v3.2-12.2B-Instruct - Float32

For full model card, and GGUFs, please go to: [ https://huggingface.co/DavidAU/L3-Stheno-v3.2-12.2B-INSTRUCT-ULTRA-F32-GGUF ] This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the passthrough merge method. ### Models Merged The following models were included in the merge: * G:/7B/L3-8B-Stheno-v3.2 * G:/7B/Meta-Llama-3-8B-Instruct ### Configuration The following YAML configuration was used to produce this model: ```yaml slices: - sources: - model: G:/7B/Meta-Llama-3-8B-Instruct layer_range: [0, 12] - sources: - model: G:/7B/L3-8B-Stheno-v3.2 layer_range: [6, 19] parameters: scale: - filter: o_proj value: 1 - filter: down_proj value: 1 - value: 1 - sources: - model: G:/7B/Meta-Llama-3-8B-Instruct layer_range: [12, 18] parameters: scale: - filter: o_proj value: .5 - filter: down_proj value: .5 - value: 1 - sources: - model: G:/7B/Meta-Llama-3-8B-Instruct layer_range: [18, 25] parameters: scale: - filter: o_proj value: .75 - filter: down_proj value: .75 - value: 1 - sources: - model: G:/7B/L3-8B-Stheno-v3.2 layer_range: [19, 32] parameters: scale: - filter: o_proj value: 1 - filter: down_proj value: 1 - value: 1 merge_method: passthrough dtype: float32 ```