kaitchup
/

Maixtchup-4x7b-QLoRA-SFT-UltraChat

Inference Endpoints

Model card Files Files and versions Community

Maixtchup-4x7b-QLoRA-SFT-UltraChat / README.md

bnjmnmarie's picture

Update README.md

ee716b9 verified 10 months ago

|

855 Bytes

	---
	library_name: transformers
	tags:
	- peft
	license: mit
	datasets:
	- HuggingFaceH4/ultrachat_200k
	language:
	- en
	---
	LoRA adapter for kaitchup/Maixtchup-4x7b briefly fine-tuned on UltraChat.

	To load and use this adapter:
	```
	model_name = "kaitchup/Maixtchup-4x7b"
	#Tokenizer
	tokenizer = AutoTokenizer.from_pretrained(model_name, use_fast=True)
	compute_dtype = getattr(torch, "float16")
	bnb_config = BitsAndBytesConfig(
	load_in_4bit=True,
	bnb_4bit_quant_type="nf4",
	bnb_4bit_compute_dtype=compute_dtype,
	bnb_4bit_use_double_quant=True,
	)
	model = AutoModelForCausalLM.from_pretrained(
	model_name, quantization_config=bnb_config, device_map="auto", attn_implementation="flash_attention_2",
	)

	model.config.use_cache = True

	model = PeftModel.from_pretrained(model, "kaitchup/Maixtchup-4x7b-QLoRA-SFT-UltraChat")
	```