We made a LLM model with tabtoyou/KoLLaVA-v1.5-Synatra-7b (mistral)
We use LoRA(r=128, alpha=256), lr=2e-5, mm_projector_lr = 0
CCTV image data(w/ BBox) used, and 100 epoch train
We are making Multi-modal LLM model for Kolon !