GGUF
Not-For-All-Audiences
nsfw
Inference Endpoints
conversational

Has this been converted with the recent llama.cpp patches applied?

#1
by FlareRebellion - opened

I'm talking about: https://github.com/ggerganov/llama.cpp/pull/8676

Coherence seems to break down at larger contexts without this.

NeverSleep org

Yes!

Sign up or log in to comment