ParasiticRogue
/

Lumimaid-Magnum-12B-exl2-8.0

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

Edit model card

Quant Details

EXL2 quant of Undi's merge using Bluemoon-Light's Nemo parquet. Cache 8bit seems more stable than 4bit in my testing with Nemo based models.

https://huggingface.co/Undi95/Lumimaid-Magnum-12B

https://huggingface.co/datasets/ParasiticRogue/Bluemoon-Light

Original Model Card Information

Merge of Lumimaid and Magnum as requested by some.

I used the new DELLA merge method in mergekit and added a finetune of Nemo only on Claude input, trained on 16k ctx, in the mix.

Prompt template: Mistral

<s>[INST] {input} [/INST] {output}</s>

Downloads last month: 20

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for ParasiticRogue/Lumimaid-Magnum-12B-exl2-8.0

NeverSleep/Lumimaid-v0.2-12B

Undi95/LocalC-12B-e2.0

intervitens/mini-magnum-12b-v1.1

mistralai/Mistral-Nemo-Instruct-2407

Merge model

this model