Llama-2-70B-GGUF-tokenizer-legacy
Tokenizer for llama-2-70b
This repository contains the following files: special_tokens_map.json, tokenizer_config.json, tokenizer.json, and tokenizer.model. These files are used to load a llama.cpp model as a HuggingFace Transformers model using llamacpp_HF loader.
Note: converted using convert_llama_weights_to_hf.py with legacy method.
How to use with oobabooga/text-generation-webui
Download a .gguf file from TheBloke/Llama-2-70B-GGUF based on your preferred quantization method;
Place your .gguf in a subfolder of models/ along with these 4 files.