Spaces:

goavinash5
/

Gradio_LLAMA_Testing

Sleeping

App Files Files Community

Gradio_LLAMA_Testing / env_examples /.env.7b_8bit_example

goavinash5's picture

Upload folder using huggingface_hub

e97665c 12 months ago

history blame contribute delete

801 Bytes

	MODEL_PATH = "./models/Llama-2-7b-chat-hf"

	# options: llama.cpp, gptq, transformers
	BACKEND_TYPE = "transformers"

	# only for transformers bitsandbytes 8 bit
	LOAD_IN_8BIT = True

	MAX_MAX_NEW_TOKENS = 2048
	DEFAULT_MAX_NEW_TOKENS = 1024
	MAX_INPUT_TOKEN_LENGTH = 4000

	DEFAULT_SYSTEM_PROMPT = "You are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature. If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information."