Any guidance on how to use this with sentence-transformers without downloading a bunch of extra stuff?

by simonw - opened Oct 6, 2023

Oct 6, 2023

This is a really cool model!

I'm using this with SentenceTransformers and it downloaded 314M of files. The big ones were:

model.safetensors 43M
pytorch_model.bin 43M
onnx/model.onnx 86M
onnx/model_optimized.onnx 86M
onnx/model_quantized.onnx 22M

Is there a way to use this with SentenceTransformers that only downloads the model file that I need?

radames

Oct 6, 2023

hi @simonw ,

You can use huggingface_hub and snapshot only the needed files and pass the folder to the SentenceTransformer constructor

from sentence_transformers import SentenceTransformer
from huggingface_hub import snapshot_download

sentences = ["This is an example sentence", "Each sentence is converted"]
model_path = snapshot_download(
    repo_id="TaylorAI/gte-tiny", allow_patterns=["*.json", "pytorch_model.bin"]
)

model = SentenceTransformer(model_path)
embeddings = model.encode(sentences)
print(embeddings)

radames

Oct 6, 2023

I'd also recommend the use of model.safetensors

from sentence_transformers import SentenceTransformer
from huggingface_hub import snapshot_download

sentences = ["This is an example sentence", "Each sentence is converted"]
model_path = snapshot_download(
    repo_id="TaylorAI/gte-tiny", allow_patterns=["*.json", "model.safetensors"]
)

model = SentenceTransformer(model_path)
embeddings = model.encode(sentences)
print(embeddings)

andersonbcdefg

Taylor org Oct 7, 2023

I can also release another minimal version without the onnx weights for SentenceTransformers.

davidmezzetti

Oct 9, 2023

txtai has built-in logic for mean/cls pooling using Transformers, which only downloads the files it needs.

For example:

import txtai

embeddings = txtai.Embeddings(path="TaylorAI/gte-tiny")
embeddings.batchtransform(["text1", "text2"])

radames

Oct 9, 2023

very cool @davidmezzetti , just test it

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment