albert-base-sanskrit / tokenizer_config.json
system's picture
system HF staff
Update tokenizer_config.json
68744c5
raw
history blame
218 Bytes
{"max_len": 512, "bos_token": "[CLS]", "eos_token": "[SEP]", "unk_token": "<unk>", "sep_token": "[SEP]", "pad_token": "<pad>", "cls_token": "[CLS]", "mask_token": "[MASK]", "do_lower_case": false, "keep_accents": true}