autoevaluator HF staff commited on
Commit
dde7494
1 Parent(s): da8c5b2

Add evaluation results on the default config and test split of xsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [xsum](https://huggingface.co/datasets/xsum) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-xsum-default-98b05d-39746145061).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=xsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=xsum).

Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -23,25 +23,25 @@ model-index:
23
  split: test
24
  metrics:
25
  - type: rouge
26
- value: 39.3722
27
  name: ROUGE-1
28
  verified: true
29
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiN2YzZmMyZjk4OTdlMmJiMWY4NzZmNTMwMTFkZmFiNjE3ZjE2MWY2ODY3MGZkYjdiNzBkNDAyMmE2ODRmNzA3YyIsInZlcnNpb24iOjF9.6OEAAa9Hy9xxkAsXyyxylTPoerL8OMg2Cc_jP0KGA25ZQCF3LLZFMrSyi93AMoqBfRCcdaJ7ypE2b1m2YVNcAw
30
  - type: rouge
31
- value: 17.5791
32
  name: ROUGE-2
33
  verified: true
34
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiY2IwYjBiYTUzNjZlMjdjNjI5NmVjNGU3N2E0MzdhYmIxODI1NTM5N2I4MTcwMGIyNWRkZjhmM2UzZDY2ZWJkYSIsInZlcnNpb24iOjF9.p30-YNYekLdiVwNFGmm3-FJP0iBQgOpkmyxjM8xlNGRGc2iihhNAwe4ewRFYmkRSjnbrHjoGKERBq_-3DzE1Cw
35
  - type: rouge
36
- value: 32.6528
37
  name: ROUGE-L
38
  verified: true
39
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjg0ZDIyMDUwYzFiZjMwYzUxOTg4MDlmODcwMTdiNzFlNDQ4NDViYTgzNmMwYmRlODgzODM0MjdlMzU3NGU3MCIsInZlcnNpb24iOjF9.kIYuS58Tws-oEyuneymCC4by4yfI0zsq2uHOUYNfEHmGYdetsgFzzBuVmTHa37KRD30MuDk-RPCy-kam25CVCg
40
  - type: rouge
41
- value: 32.6438
42
  name: ROUGE-LSUM
43
  verified: true
44
- verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOWJlMDlhOTQ4ZjI5ZWZkZTNmNzZmNGI0NzhkYmYwZGUxNDZkYzFiYTFmOGIxZTgxZWNiMGQ3YzY2NTUwYzhkYyIsInZlcnNpb24iOjF9.S4ioChbfEA1hDf4jnRBCTUk9oLX9bX82We6W4SzsLmT3BAYe1wBPgfpDvSUQ1PQhZk5IaDwfKtKF_z1coqBVDQ
45
  - type: loss
46
  value: 1.4964560270309448
47
  name: loss
 
23
  split: test
24
  metrics:
25
  - type: rouge
26
+ value: 39.3614
27
  name: ROUGE-1
28
  verified: true
29
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMWZmZDNhNWM5YjcyMzVjNjUwMWE1NDg4YmRiNGMwY2EyZDYzMGZkY2NlNWE0MzQwNDYzN2JkNzYyOGUxNmI3ZiIsInZlcnNpb24iOjF9.1ucBm8VOqZgLXmUyDkPisiFfHJ8VYvOdvUsk6R_F0QGLIBXOCf2s_pbqHauTyEQM2mAn762DpR5L4AZg7hF_BA
30
  - type: rouge
31
+ value: 17.5887
32
  name: ROUGE-2
33
  verified: true
34
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNDU3MDQwNjYzMTE2MjU5NTE0ODU1ZmI2ZjhlY2QxODA3YTYyOWExZDdiM2Y4YzZhMTU3N2IwMGQ4M2MxMTNmZiIsInZlcnNpb24iOjF9.lb6R_xg5R1TABUCSRgvEGmdkxhSRavrfllxhsk_NxKA53EC4MXeE6o7nRWPoo2nrBOb5Lcajy_5y4oPOkv84Ag
35
  - type: rouge
36
+ value: 32.6489
37
  name: ROUGE-L
38
  verified: true
39
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZmFkOTc2MTIxMmYyNTY2MWE3Y2E4ZWYwODQ5MmU3NTIxZWM2Yzg2ZDNkYjE3NDgzM2VjYTMwOTkxNjQ1YmIyYiIsInZlcnNpb24iOjF9.AAAh5SnRDnTMCEXMfEp9N7pwHITv-crNloZTnbW7TMPXtMUe7vzATOxGVMZpMe-Nsf3Wkc3JbUdaZZ9bOb17Ag
40
  - type: rouge
41
+ value: 32.6435
42
  name: ROUGE-LSUM
43
  verified: true
44
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjg1ZmNkODZlMzdkODA4MDUxMGQyNjFiMTkyYjIzMTE2NGMyOWQ1NmQ2YjY0OTRmZjVjZWNhODBiOWI1YzVlOCIsInZlcnNpb24iOjF9.GUVl2J3DCRQUqueSuCsFM8v7IDXH7EATFlQbFl730Bo8Y2aolA-V9uN7pkaU9IM1wWBz7hvILElBCE0sln6SAQ
45
  - type: loss
46
  value: 1.4964560270309448
47
  name: loss