Compatible with transformers APIs?
1
#18 opened over 1 year ago
by
qiz
How to get this running on Oobabooga with RTX 4080 16GB?
8
#17 opened over 1 year ago
by
Goldenblood56
I'm getting 0.4 tokens/s on a 4090
2
#16 opened over 1 year ago
by
androtester
.pt version uses 2gb less VRAM for me than the non-groupsized .safetensors
3
#10 opened over 1 year ago
by
Monero