Quantization made by Richard Erkhov. [Github](https://github.com/RichardErkhov) [Discord](https://discord.gg/pvy7H8DZMG) [Request more models](https://github.com/RichardErkhov/quant_request) Llama-2-7b-WikiChat-fused - bnb 4bits - Model creator: https://huggingface.co/stanford-oval/ - Original model: https://huggingface.co/stanford-oval/Llama-2-7b-WikiChat-fused/ Original model description: --- license: llama2 language: - en --- This model is a fine-tuned LLaMA-2 (7B) model. Please accept the [LLaMA-2 license agreement](https://ai.meta.com/resources/models-and-libraries/llama-downloads/) before downloading this model. Refer to the following for more information: GitHub repository: https://github.com/stanford-oval/WikiChat Paper: https://aclanthology.org/2023.findings-emnlp.157/

Wikipedia

WikiChat

Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia

Online demo: https://wikichat.genie.stanford.edu

WikiChat Pipeline