Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wcde
/
llama-30b-3bit-gr128
like
4
Text Generation
Transformers
llama
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
4f8b591
llama-30b-3bit-gr128
/
README.md
wcde
Create README.md
262a987
over 1 year ago
preview
code
|
raw
Copy download link
history
blame
Safe
86 Bytes
Generated with: --wbits 3 --groupsize 128 --true-sequential --new-eval --faster-kernel