Update README.md
Browse files
README.md
CHANGED
@@ -88,6 +88,12 @@ For a demo of multilingual image search, have a look at: [Image_Search-multiling
|
|
88 |
For more details on image search and zero-shot image classification, have a look at the documentation on [SBERT.net](https://www.sbert.net/examples/applications/image-search/README.html).
|
89 |
|
90 |
|
|
|
|
|
|
|
|
|
|
|
|
|
91 |
|
92 |
|
93 |
## Full Model Architecture
|
|
|
88 |
For more details on image search and zero-shot image classification, have a look at the documentation on [SBERT.net](https://www.sbert.net/examples/applications/image-search/README.html).
|
89 |
|
90 |
|
91 |
+
## Training
|
92 |
+
This model has been created using [Multilingual Knowledge Distillation](https://arxiv.org/abs/2004.09813). As teacher model, we used the original `clip-ViT-B-32` and then trained a [multilingual DistilBERT](https://huggingface.co/distilbert-base-multilingual-cased) model as student model. Using parallel data, the multilingual student model learns to align the teachers vector space across many languages. As a result, you get an text embedding model that works for 50+ languages.
|
93 |
+
|
94 |
+
The image encoder from CLIP is unchanged, i.e. you can use the original CLIP image encoder to encode images.
|
95 |
+
|
96 |
+
Have a look at the [SBERT.net - Multilingual-Models documentation](https://www.sbert.net/examples/training/multilingual/README.html) on more details and for **training code**.
|
97 |
|
98 |
|
99 |
## Full Model Architecture
|