catherinearnett
commited on
Commit
•
12670c5
1
Parent(s):
fa1a44b
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ library_name: transformers
|
|
12 |
|
13 |
# B-GPT_en_pl_sequential
|
14 |
|
15 |
-
This is a bilingual GPT-2 style model. For the first half of training, this model was trained only on English data. In the second half of training, the model was trained on only Polish data. At the end of training, 50
|
16 |
|
17 |
## Model details:
|
18 |
|
|
|
12 |
|
13 |
# B-GPT_en_pl_sequential
|
14 |
|
15 |
+
This is a bilingual GPT-2 style model. For the first half of training, this model was trained only on English data. In the second half of training, the model was trained on only Polish data. At the end of training, 50% of training data seen by the model is English and 50% is Polish. The tokenizer was trained on the same overall proportions of data as the language model at the final step.
|
16 |
|
17 |
## Model details:
|
18 |
|