Panchovix
/

goliath-120b-exl2-4.25bpw-rpcal

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Panchovix commited on Jan 6

Commit

dd633c5

•

1 Parent(s): 71a86f8

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ license: llama2
 ---
 EXL2 quant of alpindale/goliath-120b (https://huggingface.co/alpindale/goliath-120b), to be used on exllamav2. 4.25bpw to being to able to use CFG comfortably on 72GB VRAM. (20,21,22 for gpu split)
 Calibration dataset is a cleaned, fixed pippa RP dataset, which does affect the results (in favor) for RP usage.
 You can find the calibration dataset [here](https://huggingface.co/datasets/royallab/PIPPA-cleaned)

 ---
 EXL2 quant of alpindale/goliath-120b (https://huggingface.co/alpindale/goliath-120b), to be used on exllamav2. 4.25bpw to being to able to use CFG comfortably on 72GB VRAM. (20,21,22 for gpu split)
+Update 06/01/2024: Updated with new quant method after some time, thanks for the measurement [here](https://github.com/turboderp/exllamav2/files/13846439/goliath-120b-rpcal-measurement.json)
 Calibration dataset is a cleaned, fixed pippa RP dataset, which does affect the results (in favor) for RP usage.
 You can find the calibration dataset [here](https://huggingface.co/datasets/royallab/PIPPA-cleaned)