autoevaluator HF staff commited on
Commit
c5fffe5
1 Parent(s): c260988

Add evaluation results on the 3.0.0 config and test split of cnn_dailymail

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the 3.0.0 config and test split of the [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) dataset by @samuelallen123, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-cnn_dailymail-3.0.0-bf74a8-1524255094).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=cnn_dailymail).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=cnn_dailymail).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -38,6 +38,39 @@ model-index:
38
  type: gen_len
39
  value: 24.0342
40
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
  ---
42
 
43
  Details coming soon
 
38
  type: gen_len
39
  value: 24.0342
40
  verified: true
41
+ - task:
42
+ type: summarization
43
+ name: Summarization
44
+ dataset:
45
+ name: cnn_dailymail
46
+ type: cnn_dailymail
47
+ config: 3.0.0
48
+ split: test
49
+ metrics:
50
+ - name: ROUGE-1
51
+ type: rouge
52
+ value: 34.4055
53
+ verified: true
54
+ - name: ROUGE-2
55
+ type: rouge
56
+ value: 14.127
57
+ verified: true
58
+ - name: ROUGE-L
59
+ type: rouge
60
+ value: 24.3353
61
+ verified: true
62
+ - name: ROUGE-LSUM
63
+ type: rouge
64
+ value: 31.6582
65
+ verified: true
66
+ - name: loss
67
+ type: loss
68
+ value: 2.4456119537353516
69
+ verified: true
70
+ - name: gen_len
71
+ type: gen_len
72
+ value: 45.928
73
+ verified: true
74
  ---
75
 
76
  Details coming soon