|
--- |
|
datasets: |
|
- xzuyn/lima-multiturn-alpaca |
|
language: |
|
- en |
|
--- |
|
![](https://huggingface.co/xzuyn/LLaMa-2-LIMA-7B-QLoRA_v2/resolve/main/visual.png) |
|
|
|
Trained on a 7900XTX. |
|
|
|
[Zeus-LLM-Trainer](https://github.com/official-elinas/zeus-llm-trainer) command to recreate: |
|
``` |
|
python finetune.py --data_path "xzuyn/lima-multiturn-alpaca" --learning_rate 0.0001 --optim "paged_adamw_8bit" --train_4bit --lora_r 32 --lora_alpha 32 --prompt_template_name "alpaca_short" --num_train_epochs 15 --gradient_accumulation_steps 24 --per_device_train_batch_size 1 --logging_steps 1 --save_total_limit 20 --use_gradient_checkpointing True --save_and_eval_steps 1 --group_by_length True --cutoff_len 4096 --val_set_size 0 --use_flash_attn True --base_model "meta-llama/Llama-2-7b-hf" |
|
``` |