it just keeps generating <pad> tokens

by NePe - opened Jun 28

Discussion

NePe

Jun 28

•

edited Jun 28

I have no idea why but whatever i try it only generates pad tokens :(

anxcat

Jun 28

Same thing here and same with every else's 4 bit quants too, even with dev branch of transformers
I don't think anybody's testing their quants before posting them

NePe

Jul 1

Checked by downloading the full version and loading it with bnb 4bit, it produces the same results. It seems like its a hf/model issue not this specific upload.

dl4hf1

Jul 3

•

edited Jul 3

Checked by downloading the full version and loading it with bnb 4bit, it produces the same results. It seems like its a hf/model issue not this specific upload.

use torch_dtype=torch.bfloat16
https://huggingface.co/google/gemma-2-27b-it/discussions/16

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment