zli12321
/

answer_equivalence_distilroberta

@@ -40,7 +40,7 @@ print("Exact Match: ", match_result)
 ```
 #### Transformer Match
-Our fine-tuned BERT model is this repository. Our Package also supports downloading and matching directly. More Matching transformer models will be available 🔥🔥🔥
 ```python
 from qa_metrics.transformerMatcher import TransformerMatcher
@@ -49,7 +49,7 @@ question = "who will take the throne after the queen dies"
 tm = TransformerMatcher("distilroberta")
 scores = tm.get_scores(reference_answer, candidate_answer, question)
 match_result = tm.transformer_match(reference_answer, candidate_answer, question)
-print("Score: %s; distilroberta Match: %s" % (scores, match_result))
 ```
 #### F1 Score
@@ -71,10 +71,10 @@ question = "who will take the throne after the queen dies"
 cfm = CFMatcher()
 scores = cfm.get_scores(reference_answer, candidate_answer, question)
 match_result = cfm.cf_match(reference_answer, candidate_answer, question)
-print("Score: %s; CF Match: %s" % (scores, match_result))
 ```
-If you find this repo avialable, please cite:
 ```bibtex
 @misc{li2024cfmatch,
   title={CFMatch: Aligning Automated Answer Equivalence Evaluation with Expert Judgments For Open-Domain Question Answering},
@@ -86,10 +86,11 @@ If you find this repo avialable, please cite:
 }
 ```
 ## Updates
 - [01/24/24] 🔥 The full paper is uploaded and can be accessed [here]([https://arxiv.org/abs/2310.14566](https://arxiv.org/abs/2401.13170)). The dataset is expanded and leaderboard is updated.
 - Our Training Dataset is adapted and augmented from [Bulian et al](https://github.com/google-research-datasets/answer-equivalence-dataset). Our [dataset repo](https://github.com/zli12321/Answer_Equivalence_Dataset.git) includes the augmented training set and QA evaluation testing sets discussed in our paper.
-- Now our model supports Distilroberta, a smaller and more robust matching model than Bert!
 ## License

 ```
 #### Transformer Match
+Our fine-tuned BERT model is this repository. Our Package also supports downloading and matching directly. distilroberta, distilbert, and roberta are also supported now! 🔥🔥🔥
 ```python
 from qa_metrics.transformerMatcher import TransformerMatcher
 tm = TransformerMatcher("distilroberta")
 scores = tm.get_scores(reference_answer, candidate_answer, question)
 match_result = tm.transformer_match(reference_answer, candidate_answer, question)
+print("Score: %s; CF Match: %s" % (scores, match_result))
 ```
 #### F1 Score
 cfm = CFMatcher()
 scores = cfm.get_scores(reference_answer, candidate_answer, question)
 match_result = cfm.cf_match(reference_answer, candidate_answer, question)
+print("Score: %s; bert Match: %s" % (scores, match_result))
 ```
+If you find this repo avialable, please cite our paper:
 ```bibtex
 @misc{li2024cfmatch,
   title={CFMatch: Aligning Automated Answer Equivalence Evaluation with Expert Judgments For Open-Domain Question Answering},
 }
 ```
 ## Updates
 - [01/24/24] 🔥 The full paper is uploaded and can be accessed [here]([https://arxiv.org/abs/2310.14566](https://arxiv.org/abs/2401.13170)). The dataset is expanded and leaderboard is updated.
 - Our Training Dataset is adapted and augmented from [Bulian et al](https://github.com/google-research-datasets/answer-equivalence-dataset). Our [dataset repo](https://github.com/zli12321/Answer_Equivalence_Dataset.git) includes the augmented training set and QA evaluation testing sets discussed in our paper.
+- Now our model supports [distilroberta](https://huggingface.co/Zongxia/answer_equivalence_distilroberta), [distilbert](https://huggingface.co/Zongxia/answer_equivalence_distilbert), a smaller and more robust matching model than Bert!
 ## License