zeroshot commited on
Commit
d5ec521
1 Parent(s): 9dc6274

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -1621,6 +1621,10 @@ language:
1621
 
1622
  # bge-small-en-v1.5-quant
1623
 
 
 
 
 
1624
  ## Usage
1625
 
1626
  This is the quantized (INT8) ONNX variant of the [bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5) embeddings model accelerated with [Sparsify](https://github.com/neuralmagic/sparsify) for quantization and [DeepSparseSentenceTransformers](https://github.com/neuralmagic/deepsparse/tree/main/src/deepsparse/sentence_transformers) for inference.
 
1621
 
1622
  # bge-small-en-v1.5-quant
1623
 
1624
+ <div>
1625
+ <img src="https://huggingface.co/zeroshot/bge-small-en-v1.5-quant/resolve/main/latency.png" alt="latency" width="600" style="display:inline-block; margin-right:10px;"/>
1626
+ </div>
1627
+
1628
  ## Usage
1629
 
1630
  This is the quantized (INT8) ONNX variant of the [bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5) embeddings model accelerated with [Sparsify](https://github.com/neuralmagic/sparsify) for quantization and [DeepSparseSentenceTransformers](https://github.com/neuralmagic/deepsparse/tree/main/src/deepsparse/sentence_transformers) for inference.