请问能否提供EXL2量化版本?

by Orion-zhen - opened May 19

May 19

EXL2相较于GPTQ和AWQ, 有较低的模型和上下文显存占用, 更适合在消费端部署. 请问是否能提供EXL2量化版本? 例如4.0bpw

Jun 3

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment