--- base_model: - mistralai/Mistral-Nemo-Base-2407 - mistralai/Mistral-Large-Instruct-2407 - mistralai/Codestral-22B-v0.1 - mistralai/Mathstral-7B-v0.1 - nvidia/Mistral-NeMo-Minitron-8B-Instruct library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the passthrough merge method. ### Models Merged The following models were included in the merge: * [mistralai/Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) * [mistralai/Mistral-Large-Instruct-2407](https://huggingface.co/mistralai/Mistral-Large-Instruct-2407) * [mistralai/Codestral-22B-v0.1](https://huggingface.co/mistralai/Codestral-22B-v0.1) * [mistralai/Mathstral-7B-v0.1](https://huggingface.co/mistralai/Mathstral-7B-v0.1) * [nvidia/Mistral-NeMo-Minitron-8B-Instruct](https://huggingface.co/nvidia/Mistral-NeMo-Minitron-8B-Instruct) ### Configuration The following YAML configuration was used to produce this model: ```yaml dtype: float16 merge_method: passthrough slices: - sources: - layer_range: [16, 32] model: mistralai/Mistral-Large-Instruct-2407 - sources: - layer_range: [20, 32] model: nvidia/Mistral-NeMo-Minitron-8B-Instruct - sources: - layer_range: [24, 32] model: mistralai/Mistral-Nemo-Base-2407 - sources: - layer_range: [28, 32] model: mistralai/Codestral-22B-v0.1 - sources: - layer_range: [32, 32] model: mistralai/Mathstral-7B-v0.1 ```