Miquliz-120b-v2.0-FP8-dynamic

This quant was made for infermatic.ai

Dynamic FP8 quant of Miquliz 120B v2.0 made with AutoFP8.

Model Details

<s>[INST] {prompt} [/INST]

Safetensors

Model size

120B params

Tensor type

FP16

F8_E4M3

Inference Examples

Inference API (serverless) is not available, repository is disabled.