ENOT-AutoDL
/

gpt2-tensorrt

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

igor commited on Jun 8, 2023

Commit

84e78ed

•

1 Parent(s): 918550e

fixed typo

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ This repository contains GPT2 onnx models compatible with TensorRT:
 * gpt2-xl.onnx - GPT2-XL onnx for fp32 or fp16 engines
 * gpt2-xl-i8.onnx - GPT2-XL onnx for int8+fp32 engines
-Quantization of models was performed by the [ENOT-AutoDL](https://pypi.org/project/enot-autodl/) framewor.
 Code for building of TensorRT engines and examples published on [github](https://github.com/ENOT-AutoDL/ENOT-transformers).
 ## Metrics:

 * gpt2-xl.onnx - GPT2-XL onnx for fp32 or fp16 engines
 * gpt2-xl-i8.onnx - GPT2-XL onnx for int8+fp32 engines
+Quantization of models was performed by the [ENOT-AutoDL](https://pypi.org/project/enot-autodl/) framework.
 Code for building of TensorRT engines and examples published on [github](https://github.com/ENOT-AutoDL/ENOT-transformers).
 ## Metrics: