Best q8 conversion down from bf16 with slightly better perplexity than f16 based quants
bc0fa51
verified
nisten
commited on