Update README.md
Browse files
README.md
CHANGED
@@ -32,7 +32,7 @@ Training data contains 150,000 Chinese lyrics which are collected by [Chinese-Ly
|
|
32 |
|
33 |
## Training procedure
|
34 |
|
35 |
-
The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/) on [Tencent Cloud TI-ONE](https://cloud.tencent.com/product/tione/). We pre-train 100,000 steps with a sequence length of 512 on the basis of the model [gpt2-base-chinese-cluecorpussmall](https://huggingface.co/uer/gpt2-base-chinese-cluecorpussmall)
|
36 |
|
37 |
```
|
38 |
python3 preprocess.py --corpus_path corpora/lyric.txt \
|
|
|
32 |
|
33 |
## Training procedure
|
34 |
|
35 |
+
The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/) on [Tencent Cloud TI-ONE](https://cloud.tencent.com/product/tione/). We pre-train 100,000 steps with a sequence length of 512 on the basis of the pre-trained model [gpt2-base-chinese-cluecorpussmall](https://huggingface.co/uer/gpt2-base-chinese-cluecorpussmall)
|
36 |
|
37 |
```
|
38 |
python3 preprocess.py --corpus_path corpora/lyric.txt \
|