lee12ki
/

llama2-finetune-7b

Text Generation

fine-tuned-model

efficient-training

Model card Files Files and versions Community

lee12ki commited on 2 days ago

Commit

3c28d6b

•

1 Parent(s): 59d3697

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -30,6 +30,14 @@ This modelcard aims to be a base template for new models. It has been generated
 <!-- Provide a longer summary of what this model is. -->
 - **Developed by:** lee12ki

 <!-- Provide a longer summary of what this model is. -->
+This model, lee12ki/llama2-finetune-7b, is a fine-tuned version of NousResearch/Llama-2-7b-chat-hf. It has been optimized for text-generation tasks, particularly for instruction-following and conversational applications. The fine-tuning was performed using the Guanaco-LLaMA2-1K dataset, a high-quality dataset designed for aligning language models to human preferences.
+The fine-tuning process utilizes QLoRA (Quantized Low-Rank Adaptation), which enables efficient and memory-friendly training with 4-bit precision. LoRA configuration parameters include:
+LoRA rank (r): 64
+Alpha parameter: 16
+Dropout probability: 0.1
 - **Developed by:** lee12ki