Spaces:

ggml-org
/

gguf-my-repo

Running on A10G

App Files Files Community

Resources

View closed (116)

Add F16 and BF16 quantization

#129 opened 22 days ago by

update readme for card generation

#128 opened about 1 month ago by

[bug] asymmetric t5 models fail to quantize

#126 opened 2 months ago by

[Bug] Extra files with related name were uploaded to the resulting repository

#125 opened 2 months ago by

Issue converting PEFT LoRA fine tuned model to GGUF

#124 opened 2 months ago by

Issue converting nvidia/NV-Embed-v2 to GGUF

#123 opened 2 months ago by

Issue converting FLUX.1-dev model to GGUF format

#122 opened 2 months ago by

Add Llama 3.1 license

#121 opened 2 months ago by

Add an option to put all quantization variants in the same repo

#120 opened 2 months ago by

Phi-3.5-MoE-instruct

#117 opened 3 months ago by

Fails to quntize T5 (xl and xxl) models

#116 opened 3 months ago by

Arm optimized quants

#113 opened 3 months ago by

SaisExperiments

DeepseekForCausalLM is not supported

#112 opened 3 months ago by

Please, update converting script. Llama.cpp added support for Nemotron and Minitron architectures.

#111 opened 3 months ago by

Enable the created name repo to be without the quantization type

#110 opened 3 months ago by

I think I broke the space quantizing 4bit modle with Q4L

#106 opened 4 months ago by

Authorship Metadata support added to converter script, you may want to add the ability to add metadata overrides

#104 opened 4 months ago by

Please support this method:

#96 opened 5 months ago by

Support Q2 imatrix quants

#95 opened 5 months ago by

Maybe impose a max model size?

#33 opened 8 months ago by