PrunaAI
/

mattshumer-Llama-3-8B-16K-AWQ-4bit-smashed

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

mattshumer-Llama-3-8B-16K-AWQ-4bit-smashed

1 contributor

History: 3 commits

sharpenb's picture

Upload folder using huggingface_hub (#2)

cac8f7e verified 5 months ago

.gitattributes

1.52 kB

initial commit 7 months ago
README.md

5.35 kB

Upload folder using huggingface_hub (#2) 5 months ago
config.json

907 Bytes

Upload folder using huggingface_hub (#2) 5 months ago
generation_config.json

142 Bytes

Upload folder using huggingface_hub (#1) 7 months ago
model-00001-of-00002.safetensors

4.68 GB
LFS

Upload folder using huggingface_hub (#1) 7 months ago
model-00002-of-00002.safetensors

1.05 GB
LFS

Upload folder using huggingface_hub (#1) 7 months ago
model.safetensors.index.json

63.5 kB

Upload folder using huggingface_hub (#1) 7 months ago
plots.png

452 kB

Upload folder using huggingface_hub (#1) 7 months ago
results.json

1.49 kB

Upload folder using huggingface_hub (#1) 7 months ago
smash_config.json

1.02 kB

Upload folder using huggingface_hub (#2) 5 months ago
special_tokens_map.json

449 Bytes

Upload folder using huggingface_hub (#2) 5 months ago
tokenizer.json

9.08 MB

Upload folder using huggingface_hub (#2) 5 months ago
tokenizer_config.json

50.6 kB

Upload folder using huggingface_hub (#2) 5 months ago