catherinearnett
commited on
Commit
•
871a8be
1
Parent(s):
07ee7d0
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ library_name: transformers
|
|
12 |
|
13 |
# B-GPT_pl_en_sequential
|
14 |
|
15 |
-
This is a bilingual GPT-2 style model. For the first half of training, this model was trained only on Polish data. In the second half of training, the model was trained on only
|
16 |
|
17 |
## Model details:
|
18 |
|
|
|
12 |
|
13 |
# B-GPT_pl_en_sequential
|
14 |
|
15 |
+
This is a bilingual GPT-2 style model. For the first half of training, this model was trained only on Polish data. In the second half of training, the model was trained on only English data. At the end of training, 50 % of training data seen by the model is Polish and 50 % is English. The tokenizer was trained on the same overall proportions of data as the language model at the final step.
|
16 |
|
17 |
## Model details:
|
18 |
|