vidore
/

colpali-hard-v1.1

vidore_no_match

Model card Files Files and versions Community

manu commited on Aug 21

Commit

fd4e1bb

•

1 Parent(s): 5612e75

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
-base_model: vidore/colpaligemma-3b-mix-448-base
 license: mit
 library_name: colpali
 language:
 - en
 tags:
@@ -14,6 +14,8 @@ It is a [PaliGemma-3B](https://huggingface.co/google/paligemma-3b-mix-448) exten
 It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
 This version has right padding to fix unwanted tokens in the query encoding + hard negative mining.
 ## Model Description
@@ -59,8 +61,8 @@ def main() -> None:
     """Example script to run inference with ColPali"""
     # Load model
-    model_name = "vidore/colpali"
-    model = ColPali.from_pretrained("google/paligemma-3b-mix-448", torch_dtype=torch.bfloat16, device_map="cuda").eval()
     model.load_adapter(model_name)
     processor = AutoProcessor.from_pretrained(model_name)

 ---
 license: mit
 library_name: colpali
+base_model: vidore/colpaligemma-3b-mix-448-base
 language:
 - en
 tags:
 It was introduced in the paper [ColPali: Efficient Document Retrieval with Vision Language Models](https://arxiv.org/abs/2407.01449) and first released in [this repository](https://github.com/ManuelFay/colpali)
 This version has right padding to fix unwanted tokens in the query encoding + hard negative mining.
+It also stems from the fixed `vidore/colpaligemma-3b-mix-448-base` to guarantee deterministic projection layer initialization.
 ## Model Description
     """Example script to run inference with ColPali"""
     # Load model
+    model_name = "manu/colpali-hard-v1.1"
+    model = ColPali.from_pretrained("vidore/colpaligemma-3b-mix-448-base", torch_dtype=torch.bfloat16, device_map="cuda").eval()
     model.load_adapter(model_name)
     processor = AutoProcessor.from_pretrained(model_name)