philschmid
/

gemma-tokenizer-chatml

Inference Endpoints

Model card Files Files and versions Community

philschmid HF staff commited on Feb 29

Commit

f6230c8

•

1 Parent(s): 45011b4

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -9,6 +9,9 @@ This repository includes a fast tokenizer for [google/gemma-7b](https://huggingf
 No new tokens were added during that process to ensure that the original model's embedding doesn't need to be modified.
 ```python
 from transformers import AutoTokenizer
@@ -21,7 +24,7 @@ messages = [
 ]
 chatml = tokenizer.apply_chat_template(messages, add_generation_prompt=False, tokenize=False)
 # <bos><|im_start|>system
 # You are Gemma.<|im_end|>
 # <|im_start|>user

 No new tokens were added during that process to ensure that the original model's embedding doesn't need to be modified.
+_Note: It is important to note that this tokenizer is not 100% ChatML compliant, since it seems [google/gemma-7b](https://huggingface.co/google/gemma-7b), always requires the original `<bos>` token to be part of the input. This means the chat template is `<bos>` + `chatml` + `<eos>`_
 ```python
 from transformers import AutoTokenizer
 ]
 chatml = tokenizer.apply_chat_template(messages, add_generation_prompt=False, tokenize=False)
+print(chatml)
 # <bos><|im_start|>system
 # You are Gemma.<|im_end|>
 # <|im_start|>user