ChrisBridges
commited on
Commit
•
62951ca
1
Parent(s):
53a3396
Update README.md
Browse files
README.md
CHANGED
@@ -18,11 +18,12 @@ tags:
|
|
18 |
|
19 |
# LaBSE-Malach-Multilabel
|
20 |
|
21 |
-
A multilabel text classification model fine-tuned on
|
22 |
Based on LaBSE pretrained weights but it uses the general Hugging Face framework, not sentence-transformers.
|
23 |
Input text segments consisted of ~350 words on average.
|
24 |
|
25 |
-
Given an input string, the model predicts probablites for 1063 keyword IDs from the VHA ontology
|
26 |
Typically, probabilities >= 0.5 are "True" if encoding them in a binary vector.
|
|
|
27 |
|
28 |
The mapping from keyword IDs to labels will be added to the repository.
|
|
|
18 |
|
19 |
# LaBSE-Malach-Multilabel
|
20 |
|
21 |
+
A multilabel text classification model fine-tuned on a small English subset (Malach ASR) of the Visual History Archive.
|
22 |
Based on LaBSE pretrained weights but it uses the general Hugging Face framework, not sentence-transformers.
|
23 |
Input text segments consisted of ~350 words on average.
|
24 |
|
25 |
+
Given an input string, the model predicts probablites for 1063 keyword IDs from the VHA ontology.
|
26 |
Typically, probabilities >= 0.5 are "True" if encoding them in a binary vector.
|
27 |
+
Due to the small training data, the most likely predictions are usually correct but do not meet the threshold.
|
28 |
|
29 |
The mapping from keyword IDs to labels will be added to the repository.
|