nougat-small-onnx-quant_avx512_vnni

This was quantized from pszemraj/nougat-small-onnx using the --avx512_vnni flag. You need to have a processor with avx512_vnni instructions for this to work properly.

Downloads last month
5
Inference API
Inference API (serverless) does not yet support transformers models for this pipeline type.

Collection including pszemraj/nougat-small-onnx-quant_avx512_vnni