Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

5,683

Full-text search

Active filters: pruna-ai

PrunaAI/BEE-spoke-data-smol_llama-220M-GQA-HQQ-1bit-smashed

Text Generation • Updated Jul 15 • 6

PrunaAI/BEE-spoke-data-smol_llama-220M-GQA-HQQ-4bit-smashed

Text Generation • Updated Jul 15 • 7

PrunaAI/BEE-spoke-data-smol_llama-220M-GQA-HQQ-2bit-smashed

Text Generation • Updated Jul 15 • 7

PrunaAI/BEE-spoke-data-smol_llama-220M-GQA-QUANTO-int8bit-smashed

Updated Jul 19 • 6

PrunaAI/BEE-spoke-data-smol_llama-220M-GQA-QUANTO-int2bit-smashed

Updated Jul 19 • 5

PrunaAI/BEE-spoke-data-smol_llama-220M-GQA-QUANTO-float8bit-smashed

Updated Jul 19 • 5

PrunaAI/BEE-spoke-data-smol_llama-220M-GQA-AWQ-4bit-smashed

Text Generation • Updated Jul 15 • 5

PrunaAI/google-codegemma-1.1-2b-QUANTO-int2bit-smashed

Updated Jul 19 • 5

PrunaAI/google-codegemma-1.1-2b-HQQ-1bit-smashed

Text Generation • Updated Jul 15 • 13

PrunaAI/google-codegemma-1.1-2b-HQQ-2bit-smashed

Text Generation • Updated Jul 15 • 10

PrunaAI/google-codegemma-1.1-2b-HQQ-4bit-smashed

Text Generation • Updated Jul 15 • 12

PrunaAI/google-codegemma-1.1-2b-QUANTO-float8bit-smashed

Updated Jul 19 • 6

PrunaAI/google-codegemma-1.1-2b-QUANTO-int8bit-smashed

Updated Jul 19 • 3

PrunaAI/google-codegemma-1.1-2b-QUANTO-int4bit-smashed

Updated Jul 19 • 3

PrunaAI/google-codegemma-7b-QUANTO-int2bit-smashed

Updated Jul 19 • 5

PrunaAI/google-codegemma-7b-bnb-8bit-smashed

Text Generation • Updated Jul 15 • 10

PrunaAI/google-codegemma-7b-QUANTO-int4bit-smashed

Updated Jul 19 • 3

PrunaAI/google-codegemma-7b-QUANTO-float8bit-smashed

Updated Jul 19 • 3

PrunaAI/google-codegemma-7b-QUANTO-int8bit-smashed

Updated Jul 19 • 4

PrunaAI/mlabonne-NeuralPipe-7B-slerp-bnb-4bit-smashed

Text Generation • Updated Jul 15 • 6

PrunaAI/mlabonne-NeuralPipe-7B-slerp-bnb-8bit-smashed

Text Generation • Updated Jul 15 • 8

PrunaAI/mlabonne-NeuralPipe-7B-slerp-QUANTO-int2bit-smashed

Updated Jul 19 • 4

PrunaAI/mlabonne-NeuralPipe-7B-slerp-HQQ-4bit-smashed

Text Generation • Updated Jul 15 • 8

PrunaAI/mlabonne-NeuralPipe-7B-slerp-HQQ-2bit-smashed

Text Generation • Updated Jul 15 • 8

PrunaAI/mlabonne-NeuralPipe-7B-slerp-HQQ-1bit-smashed

Text Generation • Updated Jul 15 • 9

PrunaAI/mlabonne-NeuralPipe-7B-slerp-QUANTO-int4bit-smashed

Updated Jul 19 • 3

PrunaAI/mlabonne-NeuralPipe-7B-slerp-QUANTO-int8bit-smashed

Updated Jul 19 • 4

PrunaAI/mlabonne-NeuralPipe-7B-slerp-QUANTO-float8bit-smashed

Updated Jul 19 • 3

PrunaAI/mlabonne-NeuralPipe-7B-slerp-AWQ-4bit-smashed

Text Generation • Updated Jul 15 • 6

PrunaAI/h2oai-h2o-danube2-1.8b-chat-bnb-4bit-smashed

Text Generation • Updated Jul 15 • 6