patrickvonplaten commited on
Commit
686f1db
1 Parent(s): 086bcba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -12
README.md CHANGED
@@ -22,22 +22,33 @@ Thereby, the following datasets were being used for (1.) and (2.):
22
 
23
  1. **Datasets used for Unsupervised denoising objective**:
24
 
25
- - Pretraining Dataset: [C4](https://huggingface.co/datasets/c4)
 
 
26
 
27
  2. **Datasets used for Supervised text-to-text language modeling objective**
28
 
29
  - Sentence acceptability judgment
30
- - (CoLA (Warstadt et al., 2018))
31
- - Sentiment analysis (SST-2 (Socher et al., 2013))
32
- - Paraphrasing/sentence similarity (MRPC (Dolan and Brockett, 2005), STS-B (Cer
33
- et al., 2017), QQP (Iyer et al., 2017))
34
- - Natural language inference (MNLI (Williams et al., 2017), QNLI (Rajpurkar et al.,
35
- 2016), RTE (Dagan et al., 2005), CB (De Marneff et al., 2019))
36
- - Coreference resolution (WNLI and WSC (Levesque et al., 2012))
37
- - Sentence completion (COPA (Roemmele et al., 2011))
38
- - Word sense disambiguation (WIC (Pilehvar and Camacho-Collados, 2018))
39
- - Question answering (MultiRC (Khashabi et al., 2018), ReCoRD (Zhang et al., 2018),
40
- BoolQ (Clark et al., 2019))
 
 
 
 
 
 
 
 
 
41
 
42
  ## All T5 checkpoints
43
 
 
22
 
23
  1. **Datasets used for Unsupervised denoising objective**:
24
 
25
+ - [C4](https://huggingface.co/datasets/c4)
26
+ - [Wiki-DPR](https://huggingface.co/datasets/wiki_dpr)
27
+
28
 
29
  2. **Datasets used for Supervised text-to-text language modeling objective**
30
 
31
  - Sentence acceptability judgment
32
+ - CoLA [Warstadt et al., 2018](https://arxiv.org/abs/1805.12471)
33
+ - Sentiment analysis
34
+ - SST-2 [Socher et al., 2013](https://nlp.stanford.edu/~socherr/EMNLP2013_RNTN.pdf)
35
+ - Paraphrasing/sentence similarity
36
+ - MRPC [Dolan and Brockett, 2005](https://aclanthology.org/I05-5002)
37
+ - STS-B [Ceret al., 2017](https://arxiv.org/abs/1708.00055)
38
+ - QQP [Iyer et al., 2017](https://quoradata.quora.com/First-Quora-Dataset-Release-Question-Pairs)
39
+ - Natural language inference
40
+ - MNLI [Williams et al., 2017](https://arxiv.org/abs/1704.05426)
41
+ - QNLI [Rajpurkar et al.,2016](https://arxiv.org/abs/1606.05250)
42
+ - RTE [Dagan et al., 2005](https://link.springer.com/chapter/10.1007/11736790_9)
43
+ - CB [De Marneff et al., 2019](https://semanticsarchive.net/Archive/Tg3ZGI2M/Marneffe.pdf)
44
+ - Sentence completion
45
+ - COPA [Roemmele et al., 2011](https://www.researchgate.net/publication/221251392_Choice_of_Plausible_Alternatives_An_Evaluation_of_Commonsense_Causal_Reasoning)
46
+ - Word sense disambiguation
47
+ - WIC [Pilehvar and Camacho-Collados, 2018](https://arxiv.org/abs/1808.09121)
48
+ - Question answering
49
+ - MultiRC [Khashabi et al., 2018](https://aclanthology.org/N18-1023)
50
+ - ReCoRD [Zhang et al., 2018](https://arxiv.org/abs/1810.12885)
51
+ - BoolQ [Clark et al., 2019](https://arxiv.org/abs/1905.10044)
52
 
53
  ## All T5 checkpoints
54