wcde
/

llama-30b-3bit-gr128

Text Generation

Inference Endpoints

Model card Files Files and versions Community

llama-30b-3bit-gr128 / README.md

wcde's picture

Create README.md

262a987 over 1 year ago

|

86 Bytes

Generated with: --wbits 3 --groupsize 128 --true-sequential --new-eval --faster-kernel