SamLowe
/

roberta-base-go_emotions-onnx

Text Classification

multi-class-classification

multi-label-classification

Model card Files Files and versions Community

SamLowe commited on Sep 28, 2023

Commit

7e81e16

•

1 Parent(s): 49ebfbf

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ This model is the ONNX version of [https://huggingface.co/SamLowe/roberta-base-g
 `onnx/model.onnx` is the full precision ONNX version
-- that has identical performance to the original transformers model
 - and has the same model size (499MB)
 - is faster than inference than normal Transformers, particularly for smaller batch sizes
   - in my tests about 2x to 3x as fast for a batch size of 1 on a 8 core 11th gen i7 CPU using ONNXRuntime
@@ -32,7 +32,7 @@ This model is the ONNX version of [https://huggingface.co/SamLowe/roberta-base-g
 - that is one quarter the size (125MB) of the full precision model (above)
 - but delivers almost all of the accuracy
-- is faster than inference
   - about 2x as fast for a batch size of 1 on an 8 core 11th gen i7 CPU using ONNXRuntime vs the full precision model above
   - which makes it circa 5x as fast as the full precision normal Transformers model (on the above mentioned CPU, for a batch of 1)

 `onnx/model.onnx` is the full precision ONNX version
+- that has identical accuracy/metrics to the original transformers model
 - and has the same model size (499MB)
 - is faster than inference than normal Transformers, particularly for smaller batch sizes
   - in my tests about 2x to 3x as fast for a batch size of 1 on a 8 core 11th gen i7 CPU using ONNXRuntime
 - that is one quarter the size (125MB) of the full precision model (above)
 - but delivers almost all of the accuracy
+- is faster for inference
   - about 2x as fast for a batch size of 1 on an 8 core 11th gen i7 CPU using ONNXRuntime vs the full precision model above
   - which makes it circa 5x as fast as the full precision normal Transformers model (on the above mentioned CPU, for a batch of 1)