Edit Models filters

Inference status

Misc

8-bit precision

Misc with no match

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

4

Full-text search

Active filters: llmcompressor

neuralmagic/Llama-3.2-1B-Instruct-quantized.w8a8

Text Generation • Updated Oct 16 • 2.07k • 3

neuralmagic/Llama-3.2-3B-Instruct-FP8

Text Generation • Updated Oct 16 • 11k • 2

neuralmagic/Llama-3.2-3B-Instruct-quantized.w8a8

Text Generation • Updated Oct 16 • 1.68k • 1

neuralmagic/Llama-3.2-1B-Instruct-FP8

Text Generation • Updated Oct 16 • 259k • 1