NousResearch
/

OLMo-Bitnet-1B

Text Generation

Inference Endpoints

Model card Files Files and versions Community

OLMo-Bitnet-1B / tokenization_olmo_fast.py

emozilla's picture

Upload folder using huggingface_hub

5169b80 verified 8 months ago

645 Bytes

	from transformers import AutoTokenizer, PreTrainedTokenizerFast

	from hf_olmo.configuration_olmo import OLMoConfig


	class OLMoTokenizerFast(PreTrainedTokenizerFast):
	# Note: OLMo's tokenizer is already a wrapper around huggingface. This is potentially unnecessary.
	pass

	# def save_vocabulary(self, save_directory: str, filename_prefix: Optional[str] = None) -> Tuple[str]:
	# # This is required to make the implementation complete.
	# pass


	# Register the tokenizer class so that it is available for transformer pipelines, auto-loading etc.
	AutoTokenizer.register(OLMoConfig, fast_tokenizer_class=OLMoTokenizerFast)