Update README.md
Browse files
README.md
CHANGED
@@ -97,8 +97,8 @@ Pile-T5 can be loaded using the `AutoModelForSeq2SeqLM` functionality:
|
|
97 |
```python
|
98 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
99 |
|
100 |
-
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-
|
101 |
-
model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-
|
102 |
```
|
103 |
|
104 |
### Training
|
@@ -132,6 +132,8 @@ There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
|
|
132 |
checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
|
133 |
|
134 |
|
|
|
|
|
135 |
### Evaluations
|
136 |
|
137 |
Pile-T5 XL was evaluated on SuperGLUE, CodeXGLUE. A Flan-finetuned version was evaluated on Flan Held In tasks, MMLU and BBH.
|
|
|
97 |
```python
|
98 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
99 |
|
100 |
+
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-xl")
|
101 |
+
model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-xl")
|
102 |
```
|
103 |
|
104 |
### Training
|
|
|
132 |
checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
|
133 |
|
134 |
|
135 |
+
The training loss (in tfevent format) and validation perplexity (in jsonl) can be found [here](https://huggingface.co/EleutherAI/pile-t5-xl/blob/main/xl.zip).
|
136 |
+
|
137 |
### Evaluations
|
138 |
|
139 |
Pile-T5 XL was evaluated on SuperGLUE, CodeXGLUE. A Flan-finetuned version was evaluated on Flan Held In tasks, MMLU and BBH.
|