gpt-j-6B-tensorrt-int8 / gptj-i8.data

Commit History

added onnx model (fake quant) compatible with trt
554833e

igor commited on