Thank you for the contribution. I have a problem when using the quantized model, I load it using pipeline but it generates an error like this, do you know how to solve it?
I solved it by:
tokenizer.model_input_names = ['input_ids', 'attention_mask']
· Sign up or log in to comment