EleutherAI
/

pile-t5-xxl

Text2Text Generation

encoder-decoder

Inference Endpoints

Model card Files Files and versions Community

lintang commited on Apr 16

Commit

485c782

•

1 Parent(s): 1177fe6

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -97,8 +97,8 @@ Pile-T5 can be loaded using the `AutoModelForSeq2SeqLM` functionality:
 ```python
 from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
-tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-base")
-model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-base")
 ```
 ### Training
@@ -131,6 +131,7 @@ Intermediate checkpoints for Pile-T5 are accessible within this repository.
 There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
 checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
 ### Evaluations

 ```python
 from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
+tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-xxl")
+model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-xxl")
 ```
 ### Training
 There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
 checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
+The training loss (in tfevent format) and validation perplexity (in jsonl) can be found [here](https://huggingface.co/EleutherAI/pile-t5-xxl/blob/main/xxl.zip).
 ### Evaluations