kiddothe2b's picture
Update README.md
a62aaaa
|
raw
history blame
840 Bytes
metadata
license: cc-by-nc-sa-4.0
pipeline_tag: fill-mask
language: en
tags:
  - biomedical
  - long-documents

Biomedical Longformer (base)

This is a derivative model based on microsoft/BiomedNLP-PubMedBERT-large-uncased-abstract BERT model developed in the work "Fine-Tuning Large Neural Language Models for Biomedical Natural Language Processing" by Tinn et al. (2021). All model parameters where cloned from the original model, while the positional embeddings were extended by cloning the original embeddings multiple times following Beltagy et al. (2020) using a python script similar to this one (https://github.com/allenai/longformer/blob/master/scripts/convert_model_to_long.ipynb).