VAGO solutions quants
Collection
Quantized version for the excellent german speaking models created by VAGO solutions.
•
6 items
•
Updated
•
2
This model was converted to MLX format from VAGOsolutions/SauerkrautLM-7b-LaserChat
.
Refer to the original model card for more details on the model.
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mayflowergmbh/SauerkrautLM-7b-LaserChat-4bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)