andreasmadsen
/

efficient_mlm_m0.70

roberta-prelayernorm

Model card Files Files and versions Community

efficient_mlm_m0.70 / README.md

andreasmadsen's picture

Upload with huggingface_hub

36c0318 almost 2 years ago

|

history blame contribute delete

630 Bytes

	---
	inference: false
	---
	This is a model checkpoint for ["Should You Mask 15% in Masked Language Modeling"](https://arxiv.org/abs/2202.08005) [(code)](https://github.com/princeton-nlp/DinkyTrain.git).

	The original checkpoint is avaliable at [princeton-nlp/efficient_mlm_m0.70](https://huggingface.co/princeton-nlp/efficient_mlm_m0.70). Unfortunately this checkpoint depends on code that isn't part of the official `transformers`
	library. Additionally, the checkpoints contains unused weights due to a bug.

	This checkpoint fixes the unused weights issue and uses the `RobertaPreLayerNorm` model from the `transformers`
	library.