lewtun HF staff commited on
Commit
af2314d
1 Parent(s): 46c9910

Add evaluation results on the Blaise-g--PubMed_summ config and test split of Blaise-g/PubMed_summ

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the Blaise-g--PubMed_summ config and test split of the [Blaise-g/PubMed_summ](https://huggingface.co/datasets/Blaise-g/PubMed_summ) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-Blaise-g__PubMed_summ-Blaise-g__PubMed_summ-0234b8-1465653969).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=Blaise-g/PubMed_summ).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=Blaise-g/PubMed_summ).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -284,6 +284,39 @@ model-index:
284
  type: gen_len
285
  value: 239.4179
286
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
287
  ---
288
  # pszemraj/long-t5-tglobal-base-16384-booksum-V12
289
 
 
284
  type: gen_len
285
  value: 239.4179
286
  verified: true
287
+ - task:
288
+ type: summarization
289
+ name: Summarization
290
+ dataset:
291
+ name: Blaise-g/PubMed_summ
292
+ type: Blaise-g/PubMed_summ
293
+ config: Blaise-g--PubMed_summ
294
+ split: test
295
+ metrics:
296
+ - name: ROUGE-1
297
+ type: rouge
298
+ value: 32.7372
299
+ verified: true
300
+ - name: ROUGE-2
301
+ type: rouge
302
+ value: 7.215
303
+ verified: true
304
+ - name: ROUGE-L
305
+ type: rouge
306
+ value: 17.4859
307
+ verified: true
308
+ - name: ROUGE-LSUM
309
+ type: rouge
310
+ value: 29.1436
311
+ verified: true
312
+ - name: loss
313
+ type: loss
314
+ value: 2.3613107204437256
315
+ verified: true
316
+ - name: gen_len
317
+ type: gen_len
318
+ value: 146.2036
319
+ verified: true
320
  ---
321
  # pszemraj/long-t5-tglobal-base-16384-booksum-V12
322