autoevaluator's picture
Add evaluation results on the 3.0.0 config and test split of cnn_dailymail
c5fffe5
|
raw
history blame
1.48 kB
metadata
language: en
tags:
  - summarization
model-index:
  - name: SamuelAllen123/t5-efficient-large-nl36_fine_tune_sum_V2
    results:
      - task:
          type: summarization
          name: Summarization
        dataset:
          name: samsum
          type: samsum
          config: samsum
          split: test
        metrics:
          - name: ROUGE-1
            type: rouge
            value: 50.4987
            verified: true
          - name: ROUGE-2
            type: rouge
            value: 25.6888
            verified: true
          - name: ROUGE-L
            type: rouge
            value: 41.7283
            verified: true
          - name: ROUGE-LSUM
            type: rouge
            value: 46.2626
            verified: true
          - name: loss
            type: loss
            value: 1.5158178806304932
            verified: true
          - name: gen_len
            type: gen_len
            value: 24.0342
            verified: true
      - task:
          type: summarization
          name: Summarization
        dataset:
          name: cnn_dailymail
          type: cnn_dailymail
          config: 3.0.0
          split: test
        metrics:
          - name: ROUGE-1
            type: rouge
            value: 34.4055
            verified: true
          - name: ROUGE-2
            type: rouge
            value: 14.127
            verified: true
          - name: ROUGE-L
            type: rouge
            value: 24.3353
            verified: true
          - name: ROUGE-LSUM
            type: rouge
            value: 31.6582
            verified: true
          - name: loss
            type: loss
            value: 2.4456119537353516
            verified: true
          - name: gen_len
            type: gen_len
            value: 45.928
            verified: true

Details coming soon