igor
commited on
Commit
•
84e78ed
1
Parent(s):
918550e
fixed typo
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ This repository contains GPT2 onnx models compatible with TensorRT:
|
|
20 |
* gpt2-xl.onnx - GPT2-XL onnx for fp32 or fp16 engines
|
21 |
* gpt2-xl-i8.onnx - GPT2-XL onnx for int8+fp32 engines
|
22 |
|
23 |
-
Quantization of models was performed by the [ENOT-AutoDL](https://pypi.org/project/enot-autodl/)
|
24 |
Code for building of TensorRT engines and examples published on [github](https://github.com/ENOT-AutoDL/ENOT-transformers).
|
25 |
|
26 |
## Metrics:
|
|
|
20 |
* gpt2-xl.onnx - GPT2-XL onnx for fp32 or fp16 engines
|
21 |
* gpt2-xl-i8.onnx - GPT2-XL onnx for int8+fp32 engines
|
22 |
|
23 |
+
Quantization of models was performed by the [ENOT-AutoDL](https://pypi.org/project/enot-autodl/) framework.
|
24 |
Code for building of TensorRT engines and examples published on [github](https://github.com/ENOT-AutoDL/ENOT-transformers).
|
25 |
|
26 |
## Metrics:
|