Does this work with Ollama?

by aravindsr - opened May 12

Discussion

aravindsr

May 12

got the error invalid file magic

ddh0

Owner May 12

Corrupted download maybe? The magic number is correct for me

aravindsr

May 14

Redownloaded and tried again. Got the same error :(

ddh0

Owner May 14

Hmm. I’m sorry, I have no idea. As you can see in the README it works just fine for me, and I’ve had no other complaints. Maybe make sure you’re in the latest llama.cpp version?

vidumec

May 16

getting "GGML_ASSERT: ggml-metal.m:1540: false && "MUL MAT-MAT not implemented"" crash with latest compiled llama.cpp on M3 Max

ddh0

Owner May 16

•

edited May 16

getting "GGML_ASSERT: ggml-metal.m:1540: false && "MUL MAT-MAT not implemented"" crash with latest compiled llama.cpp on M3 Max

@vidumec Retry with batch size >= 16 for the time being. bfloat16 support is still being worked on

vidumec

May 16

didn't help, but i just realized looking at the code, it hasn't been implemented for Metal, so it could only work on CPU

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment