yaystevek
/

llama-3-8b-Instruct-OpenHermes-2.5-QLoRA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llama-3-8b-Instruct-OpenHermes-2.5-QLoRA / README.md

yaystevek's picture

Update README.md

c54f859 verified 7 months ago

|

history blame contribute delete

681 Bytes

	---
	language:
	- en
	datasets:
	- teknium/OpenHermes-2.5
	license: other
	license_name: llama3
	base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
	tags:
	- unsloth
	- facebook
	- meta
	- pytorch
	- llama
	- llama-3
	- trl
	---

	# QLoRA Finetune Llama 3 Instruct 8B + OpenHermes 2.5

	This model is based on Llama-3-8b, and is governed by [META LLAMA 3 COMMUNITY LICENSE AGREEMENT](LICENSE)

	Llama 3 Instruct 8B 4-bit from unsloth, finetuned with the OpenHermes 2.5 dataset on my home PC on one 24GB 4090.

	Special care was taken to preserve and reinforce proper eos token structure.

	[GGUF (f16 and Q4_K_M) available](https://huggingface.co/yaystevek/llama-3-8b-Instruct-OpenHermes-2.5-QLoRA-GGUF)