Goader
/

liberta-large

Inference Endpoints

Model card Files Files and versions Community

Goader commited on Jun 27

Commit

102ba64

•

1 Parent(s): 3d697bd

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -13,6 +13,8 @@ pipeline_tag: fill-mask
 <!-- Provide a quick summary of what the model is/does. -->
 LiBERTa Large is a BERT-like model pre-trained from scratch exclusively for Ukrainian. It was presented during the [UNLP](https://unlp.org.ua/) @ [LREC-COLING 2024](https://lrec-coling-2024.org/). Further details are in the [LiBERTa: Advancing Ukrainian Language Modeling through Pre-training from Scratch](https://aclanthology.org/2024.unlp-1.14/) paper.
 ## Evaluation
@@ -32,6 +34,16 @@ Read the [paper](https://aclanthology.org/2024.unlp-1.14/) for more detailed tas
 | [liberta-large](https://huggingface.co/Goader/liberta-large)                                                            | 91.27 (1.22)             | 92.50 (0.07)       | 98.62 (0.08)                   | 95.44 (0.04)                             |
 | [liberta-large-v2](https://huggingface.co/Goader/liberta-large-v2)                                                      | __91.73 (1.81)__         | __93.22 (0.14)__   | __98.79 (0.06)__               | 95.67 (0.12)                             |
 ## How to Get Started with the Model

 <!-- Provide a quick summary of what the model is/does. -->
 LiBERTa Large is a BERT-like model pre-trained from scratch exclusively for Ukrainian. It was presented during the [UNLP](https://unlp.org.ua/) @ [LREC-COLING 2024](https://lrec-coling-2024.org/). Further details are in the [LiBERTa: Advancing Ukrainian Language Modeling through Pre-training from Scratch](https://aclanthology.org/2024.unlp-1.14/) paper.
+All the code is available in the [Goader/ukr-lm](https://github.com/Goader/ukr-lm) repository.
 ## Evaluation
 | [liberta-large](https://huggingface.co/Goader/liberta-large)                                                            | 91.27 (1.22)             | 92.50 (0.07)       | 98.62 (0.08)                   | 95.44 (0.04)                             |
 | [liberta-large-v2](https://huggingface.co/Goader/liberta-large-v2)                                                      | __91.73 (1.81)__         | __93.22 (0.14)__   | __98.79 (0.06)__               | 95.67 (0.12)                             |
+## Fine-Tuning Hyperparameters
+| Hyperparameter | Value |
+|:---------------|:-----:|
+| Peak Learning Rate  | 3e-5   |
+| Warm-up Ratio       | 0.05   |
+| Learning Rate Decay | Linear |
+| Batch Size          | 16     |
+| Epochs              | 10     |
+| Weight Decay        | 0.05   |
 ## How to Get Started with the Model