llama_model_load: error loading model: invalid split file

by Stefanvarunix - opened Apr 14

Apr 14

Hi,
I tried to merge the files with
cat dbrx-16x12b-instruct-q6_k-* > dbrx-16x12b-instruct-q6_k.gguf

When running
./server -m ./dbrx-16x12b-instruct-q6_k.gguf
I get the error message
"llama_model_load: error loading model: invalid split file:"

My mistake?

phymbert

Owner Apr 15

Llama.cpp sharded model support is built-in now, do not concat files in a binary way. You can use gguf-split to merge or simply pass the first split to llama_model_loader.

phymbert changed discussion status to closed Apr 15

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment