Multilingual Translate-Distill
Collection
Collection of trained model and teacher scores for distillation for paper "Multilingual Translate-Distill" Code: https://github.com/hltcoe/ColBERT-X
•
9 items
•
Updated
Multilingual Translate-Distill is a training technique that produces state-of-the-art MLIR dense retrieval model through translation and distillation.
plaidx-large-clef-mtd-mix-entries-mt5xxl-engeng
is trained with KL-Divergence from the mt5xxl
MonoT5 reranker
unicamp-dl/mt5-13b-mmarco-100k
inferenced on English MS MARCO training queries and passages.
The teacher scores can be found in
hltcoe/tdist-msmarco-scores
.
round-robin-entires
, see below)mix-passages
: languages are randomly assigned to the 6 sampled passages for a given query during training.mix-entries
: all passages in the a given query-passage set are randomly assigned to the same language.round-robin-entires
: for each query, the query-passage set is repeated n
times to iterate through all languages.To properly load ColBERT-X models from Huggingface Hub, please use the following version of PLAID-X.
pip install PLAID-X>=0.3.1
Following code snippet loads the model through Huggingface API.
from colbert.modeling.checkpoint import Checkpoint
from colbert.infra import ColBERTConfig
Checkpoint('hltcoe/plaidx-large-clef-mtd-mix-entries-mt5xxl-engeng', colbert_config=ColBERTConfig())
For full tutorial, please refer to the PLAID-X Jupyter Notebook, which is part of the SIGIR 2023 CLIR Tutorial.
Please cite the following two papers if you use the model.
@inproceedings{mtt,
title = {Neural Approaches to Multilingual Information Retrieval},
author = {Dawn Lawrie and Eugene Yang and Douglas W Oard and James Mayfield},
booktitle = {Proceedings of the 45th European Conference on Information Retrieval (ECIR)},
year = {2023},
doi = {10.1007/978-3-031-28244-7_33},
url = {https://arxiv.org/abs/2209.01335}
}
@inproceedings{mtd,
author = {Eugene Yang and Dawn Lawrie and James Mayfield},
title = {Distillation for Multilingual Information Retrieval},
booktitle = {Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) (Short Paper) (Accepted)},
year = {2024}
url = {https://arxiv.org/abs/2405.00977}
}