--- pipeline_tag: text-generation --- 5.90 bpw exl2 quant of [c4ai-command-r-plus](https://huggingface.co/CohereForAI/c4ai-command-r-plus) Fits in 96 GB VRAM with 20k context. More quants: https://huggingface.co/turboderp/command-r-plus-103B-exl2