philipphager
/

baidu-ultr_uva-bert_naive-listwise

Transformers

Safetensors

bert

Inference Endpoints

Model card Files Files and versions Community

philipphager commited on Apr 30

Commit

e707f58

•

1 Parent(s): 44584ab

Update README.md

Browse files

Files changed (1) hide show

README.md +10 -10

README.md CHANGED Viewed

@@ -16,17 +16,17 @@ metrics:
 # Naive Listwise MonoBERT trained on Baidu-ULTR
 A flax-based MonoBERT cross encoder trained on the [Baidu-ULTR](https://arxiv.org/abs/2207.03051) dataset with a **listwise softmax cross-entropy loss on clicks**. The loss is called "naive" as we use user clicks as a signal of relevance without any additional position bias correction. For more info, [read our paper](https://arxiv.org/abs/2404.02543) and [find the code for this model here](https://github.com/philipphager/baidu-bert-model).
-## Test Results on Baidu-ULTR Expert Annotations
-| Model               | Log-likelihood | DCG@1 | DCG@3 | DCG@5 | DCG@10 | nDCG@10 | MRR@10 |
-|---------------------|----------------|-------|-------|-------|--------|---------|--------|
-| Pointwise Naive     | 0.227          | 1.641 | 3.462 | 4.752 | 7.251  | 0.357   | 0.609  |
-| Pointwise Two-Tower | 0.218          | 1.629 | 3.471 | 4.822 | 7.456  | 0.367   | 0.607  |
-| Pointwise IPS       | 0.222          | 1.295 | 2.811 | 3.977 | 6.296  | 0.307   | 0.534  |
-| Listwise Naive      | -              | 1.947 | 4.108 | 5.614 | 8.478  | 0.405   | 0.639  |
-| Listwise IPS        | -              | 1.671 | 3.530 | 4.873 | 7.450  | 0.361   | 0.603  |
-| Listwise DLA        | -              | 1.796 | 3.730 | 5.125 | 7.802  | 0.377   | 0.615  |
 ## Usage

 # Naive Listwise MonoBERT trained on Baidu-ULTR
 A flax-based MonoBERT cross encoder trained on the [Baidu-ULTR](https://arxiv.org/abs/2207.03051) dataset with a **listwise softmax cross-entropy loss on clicks**. The loss is called "naive" as we use user clicks as a signal of relevance without any additional position bias correction. For more info, [read our paper](https://arxiv.org/abs/2404.02543) and [find the code for this model here](https://github.com/philipphager/baidu-bert-model).
+## Test Results on Baidu-ULTR
+Ranking performance is measured in DCG, nDCG, and MRR on expert annotations (6,985 queries). Click prediction performance is measured in log-likelihood on one test partition of user clicks (49,495 queries).
+| Model                                                                                          | Log-likelihood | DCG@1 | DCG@3 | DCG@5 | DCG@10 | nDCG@10 | MRR@10 |
+|------------------------------------------------------------------------------------------------|----------------|-------|-------|-------|--------|---------|--------|
+| [Pointwise Naive](https://huggingface.co/philipphager/baidu-ultr_uva-bert_naive-pointwise)     | 0.227          | 1.641 | 3.462 | 4.752 | 7.251  | 0.357   | 0.609  |
+| [Pointwise Two-Tower](https://huggingface.co/philipphager/baidu-ultr_uva-bert_twotower)        | 0.218          | 1.629 | 3.471 | 4.822 | 7.456  | 0.367   | 0.607  |
+| [Pointwise IPS](https://huggingface.co/philipphager/baidu-ultr_uva-bert_ips-pointwise)         | 0.222          | 1.295 | 2.811 | 3.977 | 6.296  | 0.307   | 0.534  |
+| [Listwise Naive](https://huggingface.co/philipphager/baidu-ultr_uva-bert_naive-listwise)       | -              | 1.947 | 4.108 | 5.614 | 8.478  | 0.405   | 0.639  |
+| [Listwise IPS](https://huggingface.co/philipphager/baidu-ultr_uva-bert_ips-listwise)           | -              | 1.671 | 3.530 | 4.873 | 7.450  | 0.361   | 0.603  |
+| [Listwise DLA](https://huggingface.co/philipphager/baidu-ultr_uva-bert_dla)                    | -              | 1.796 | 3.730 | 5.125 | 7.802  | 0.377   | 0.615  |
 ## Usage