jcblaise
/

distilbert-tagalog-base-cased

Model card Files Files and versions Community

jcblaise commited on Nov 12, 2021

Commit

a7ff472

•

1 Parent(s): e5dcd63

Update README.md

Files changed (1) hide show

README.md +5 -10

README.md CHANGED Viewed

@@ -9,6 +9,10 @@ license: gpl-3.0
 inference: false
 ---
 # DistilBERT Tagalog Base Cased
 Tagalog version of DistilBERT, distilled from [`bert-tagalog-base-cased`](https://huggingface.co/jcblaise/bert-tagalog-base-cased). This model is part of a larger research project. We open-source the model to allow greater usage within the Filipino NLP community.
@@ -32,15 +36,6 @@ Finetuning scripts and other utilities we use for our projects can be found in o
 All model details and training setups can be found in our papers. If you use our model or find it useful in your projects, please cite our work:
 ```
-@inproceedings{localization2020cruz,
-  title={{Localization of Fake News Detection via Multitask Transfer Learning}},
-  author={Cruz, Jan Christian Blaise and Tan, Julianne Agatha and Cheng, Charibeth},
-  booktitle={Proceedings of The 12th Language Resources and Evaluation Conference},
-  pages={2589--2597},
-  year={2020},
-  url={https://www.aclweb.org/anthology/2020.lrec-1.315}
-}
 @article{cruz2020establishing,
   title={Establishing Baselines for Text Classification in Low-Resource Languages},
   author={Cruz, Jan Christian Blaise and Cheng, Charibeth},
@@ -60,4 +55,4 @@ All model details and training setups can be found in our papers. If you use our
 Data used to train this model as well as other benchmark datasets in Filipino can be found in my website at https://blaisecruz.com
 ## Contact
-If you have questions, concerns, or if you just want to chat about NLP and low-resource languages in general, you may reach me through my work email at jan_christian_cruz@dlsu.edu.ph

 inference: false
 ---
+**Deprecation Notice**
+This model is deprecated. New Filipino Transformer models trained with a much larger corpora are available.
+Use [`jcblaise/roberta-tagalog-base`](https://huggingface.co/jcblaise/roberta-tagalog-base) or [`jcblaise/roberta-tagalog-large`](https://huggingface.co/jcblaise/roberta-tagalog-large) instead for better performance.
 # DistilBERT Tagalog Base Cased
 Tagalog version of DistilBERT, distilled from [`bert-tagalog-base-cased`](https://huggingface.co/jcblaise/bert-tagalog-base-cased). This model is part of a larger research project. We open-source the model to allow greater usage within the Filipino NLP community.
 All model details and training setups can be found in our papers. If you use our model or find it useful in your projects, please cite our work:
 ```
 @article{cruz2020establishing,
   title={Establishing Baselines for Text Classification in Low-Resource Languages},
   author={Cruz, Jan Christian Blaise and Cheng, Charibeth},
 Data used to train this model as well as other benchmark datasets in Filipino can be found in my website at https://blaisecruz.com
 ## Contact
+If you have questions, concerns, or if you just want to chat about NLP and low-resource languages in general, you may reach me through my work email at me@blaisecruz.com