falcon-generate-560-smallset / tokenizer_config.json

Training in progress, step 10

74347f7 about 1 year ago

248 Bytes

	{
	"add_prefix_space": false,
	"clean_up_tokenization_spaces": true,
	"eos_token": "<\|endoftext\|>",
	"model_input_names": [
	"input_ids",
	"attention_mask"
	],
	"model_max_length": 2048,
	"tokenizer_class": "PreTrainedTokenizerFast"
	}