This is a test model because the previous attempt failed.
Prompt format is: ChatML
Merged model: mpasila/Viking-SlimSonnet-v0.2-7B
Trained with regular LoRA (not quantized/QLoRA) and LoRA rank was 128 and Alpha set to 32. Trained for 5000 steps (0.11 epoch).
Uploaded model
- Developed by: mpasila
- License: apache-2.0
- Finetuned from model : LumiOpen/Viking-7B
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 7
Model tree for mpasila/Viking-SlimSonnet-v0.2-LoRA-7B
Base model
LumiOpen/Viking-7B