patrickvonplaten
commited on
Commit
•
686f1db
1
Parent(s):
086bcba
Update README.md
Browse files
README.md
CHANGED
@@ -22,22 +22,33 @@ Thereby, the following datasets were being used for (1.) and (2.):
|
|
22 |
|
23 |
1. **Datasets used for Unsupervised denoising objective**:
|
24 |
|
25 |
-
-
|
|
|
|
|
26 |
|
27 |
2. **Datasets used for Supervised text-to-text language modeling objective**
|
28 |
|
29 |
- Sentence acceptability judgment
|
30 |
-
-
|
31 |
-
- Sentiment analysis
|
32 |
-
-
|
33 |
-
|
34 |
-
-
|
35 |
-
|
36 |
-
-
|
37 |
-
-
|
38 |
-
-
|
39 |
-
-
|
40 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
41 |
|
42 |
## All T5 checkpoints
|
43 |
|
|
|
22 |
|
23 |
1. **Datasets used for Unsupervised denoising objective**:
|
24 |
|
25 |
+
- [C4](https://huggingface.co/datasets/c4)
|
26 |
+
- [Wiki-DPR](https://huggingface.co/datasets/wiki_dpr)
|
27 |
+
|
28 |
|
29 |
2. **Datasets used for Supervised text-to-text language modeling objective**
|
30 |
|
31 |
- Sentence acceptability judgment
|
32 |
+
- CoLA [Warstadt et al., 2018](https://arxiv.org/abs/1805.12471)
|
33 |
+
- Sentiment analysis
|
34 |
+
- SST-2 [Socher et al., 2013](https://nlp.stanford.edu/~socherr/EMNLP2013_RNTN.pdf)
|
35 |
+
- Paraphrasing/sentence similarity
|
36 |
+
- MRPC [Dolan and Brockett, 2005](https://aclanthology.org/I05-5002)
|
37 |
+
- STS-B [Ceret al., 2017](https://arxiv.org/abs/1708.00055)
|
38 |
+
- QQP [Iyer et al., 2017](https://quoradata.quora.com/First-Quora-Dataset-Release-Question-Pairs)
|
39 |
+
- Natural language inference
|
40 |
+
- MNLI [Williams et al., 2017](https://arxiv.org/abs/1704.05426)
|
41 |
+
- QNLI [Rajpurkar et al.,2016](https://arxiv.org/abs/1606.05250)
|
42 |
+
- RTE [Dagan et al., 2005](https://link.springer.com/chapter/10.1007/11736790_9)
|
43 |
+
- CB [De Marneff et al., 2019](https://semanticsarchive.net/Archive/Tg3ZGI2M/Marneffe.pdf)
|
44 |
+
- Sentence completion
|
45 |
+
- COPA [Roemmele et al., 2011](https://www.researchgate.net/publication/221251392_Choice_of_Plausible_Alternatives_An_Evaluation_of_Commonsense_Causal_Reasoning)
|
46 |
+
- Word sense disambiguation
|
47 |
+
- WIC [Pilehvar and Camacho-Collados, 2018](https://arxiv.org/abs/1808.09121)
|
48 |
+
- Question answering
|
49 |
+
- MultiRC [Khashabi et al., 2018](https://aclanthology.org/N18-1023)
|
50 |
+
- ReCoRD [Zhang et al., 2018](https://arxiv.org/abs/1810.12885)
|
51 |
+
- BoolQ [Clark et al., 2019](https://arxiv.org/abs/1905.10044)
|
52 |
|
53 |
## All T5 checkpoints
|
54 |
|