Update README.md
Browse files
README.md
CHANGED
@@ -97,8 +97,8 @@ Pile-T5 can be loaded using the `AutoModelForSeq2SeqLM` functionality:
|
|
97 |
```python
|
98 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
99 |
|
100 |
-
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-
|
101 |
-
model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-
|
102 |
```
|
103 |
|
104 |
### Training
|
@@ -131,6 +131,7 @@ Intermediate checkpoints for Pile-T5 are accessible within this repository.
|
|
131 |
There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
|
132 |
checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
|
133 |
|
|
|
134 |
|
135 |
### Evaluations
|
136 |
|
|
|
97 |
```python
|
98 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
99 |
|
100 |
+
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-xxl")
|
101 |
+
model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-xxl")
|
102 |
```
|
103 |
|
104 |
### Training
|
|
|
131 |
There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
|
132 |
checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
|
133 |
|
134 |
+
The training loss (in tfevent format) and validation perplexity (in jsonl) can be found [here](https://huggingface.co/EleutherAI/pile-t5-xxl/blob/main/xxl.zip).
|
135 |
|
136 |
### Evaluations
|
137 |
|