google/gemma-7b-it · Fix chat template does not compatible with ConversationalPipeline

Feb 23

•

Before this pr:

from transformers import pipeline, Conversation
chatbot = pipeline("conversational", model="google/gemma-7b-it")
conversation = Conversation("who are you")
conversation = chatbot(conversation)
conversation.messages[-1]["content"]
# I am a digital entity, and I am currently residing within the digital realm. I am not human, I am a artificial entity. I am here to serve you and to fulfill your requests.

After this pr:

from transformers import pipeline, Conversation
chatbot = pipeline("conversational", model="google/gemma-7b-it")
conversation = Conversation("who are you")
conversation = chatbot(conversation)
conversation.messages[-1]["content"]
# I am a large language model, trained by Google. I am here to help you with your questions and provide you with information. I am still under development, but I am constantly learning new things.

There are two manners that we can convert text inputs to token ids:

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("google/gemma-7b-it")
chat = [{"role": "user", "content": "hi"}]
prompt = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
prompt = tokenizer.encode(prompt, add_special_tokens=True)
# [2, 106, 1645, 108, 544, 107, 108, 106, 2516, 108]

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("google/gemma-7b-it")
chat = [{"role": "user", "content": "hi"}]
prompt = tokenizer.apply_chat_template(chat, tokenize=True, add_generation_prompt=True)
# [106, 1645, 108, 544, 107, 108, 106, 2516, 108]

However, transformers.ConversationalPipeline adopts the latter manner, resulting a wrong inputs.

def preprocess(self, conversation: Conversation, min_length_for_response=32) -> Dict[str, Any]:
        input_ids = self.tokenizer.apply_chat_template(conversation, add_generation_prompt=True)

        if self.framework == "pt":
            input_ids = torch.LongTensor([input_ids])
        elif self.framework == "tf":
            input_ids = tf.constant([input_ids])
        return {"input_ids": input_ids, "conversation": conversation}

https://github.com/huggingface/transformers/blob/v4.38.1/src/transformers/pipelines/conversational.py#L298-L305

Update tokenizer_config.json57725d2d

Update README.md42368cc9

hiyouga changed pull request title from Update tokenizer_config.json to Fix chat template does not compatible with ConversationalPipeline Feb 23

hiyouga

Feb 23

•

edited Feb 23

@pcuenq @suryabhupa