GGUF Please

by HR1777 - opened Jan 7

Discussion

HR1777

Jan 7

@TheBloke Please make the GGUF version of this model

compilade

Mar 3

•

edited Mar 8

llama.cpp support for Mamba is coming soon, see https://github.com/ggerganov/llama.cpp/pull/5328

Converting requires adding at least "architectures": ["MambaForCausalLM"], to config.json, though.

Tibbnak

Mar 9

Got merged

count-zero

Mar 11

@jondurbin Please add the missing "architectures": ["MambaForCausalLM"], line to the config.json, so that it can be quantized with llama.cpp without any further manipulation.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment