Will write a proper model card later. It's Mistral 7B v0.1 finetuned on MaziyarPanahi/magpie-ultra-v0.1-sharegpt for one epoch. GaLore full training. ChatML prompt format. It needs non-zero rep_p sampling, around 1.05. Took around 16 hours on 3090 Ti.