Edit model card
Configuration Parsing Warning: In config.json: "quantization_config.bits" must be an integer

Rombos-LLM-V2.5-Qwen-32b 4.5 BPW exl2

4.5 BPW quant of https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-32b

Scores 63.9 on Aider benchmarks!


Rombos-LLM-V2.5-Qwen-32b

image/jpeg

Rombos-LLM-V2.5-Qwen-32b is a continues finetuned version of Qwen2.5-32B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the Ties merge method

This version of the model shows higher performance than the original instruct and base models.

Quants: (Coming soon)

GGUF: https://huggingface.co/bartowski/Replete-LLM-V2.5-Qwen-32b-GGUF

EXL2:

Benchmarks: (Coming soon)

Downloads last month
20
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for jth01/Rombos-LLM-V2.5-Qwen-32b-4.5bpw-exl2

Base model

Qwen/Qwen2.5-32B
Quantized
(61)
this model