Text Generation
Transformers
Safetensors
llama
text-generation-inference
4-bit precision
awq
File size: 90 Bytes
0bc6466
 
 
 
 
 
1
2
3
4
5
6
{
    "zero_point": true,
    "q_group_size": 128,
    "w_bit": 4,
    "version": "GEMM"
}