Is there any guidance how to use these models, eg. 2.5bit Llama?
Thank you
You would use them with ExLlamaV2. There's also support in text-generation-webui.
· Sign up or log in to comment