Is the KV cache of these models unusually high?
1
#6 opened 6 months ago
by
Hugsanir
prompt eval too slow
2
#4 opened 7 months ago
by
lfjmgs
can you guys share the size & perlexity tables thanks
1
#3 opened 7 months ago
by
habout632
About q4_k and q5_k
1
#2 opened 8 months ago
by
stduhpf
Cannot load model due to invalid format
2
#1 opened 8 months ago
by
ABX-AI