Trying to quantize granite-3B

by Interpause - opened May 18

May 18

FYI if you decide to quantize granite-3B. It is missing from the model config, but the paper mentioned it was trained with RoPE, although it never mentioned what scale. From the ppl scores, seem it was indeed trained with RoPE after all.

turboderp

Owner May 18

The 3B and 8B variants are just Llama models, architecturally. Only 20B and up are GPTBigCode. ExLlama should automatically detect it based on the architecture string in config.json.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment