Training complete

Browse files

Files changed (4) hide show

README.md +68 -0
generation_config.json +6 -0
runs/Jul24_07-54-54_479fe7a920df/events.out.tfevents.1721807717.479fe7a920df.237.1 +2 -2
runs/Jul24_07-54-54_479fe7a920df/events.out.tfevents.1721810464.479fe7a920df.237.2 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,68 @@

+---
+license: apache-2.0
+base_model: t5-small
+tags:
+- summarization
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: t5-small-finetuned-cnn-news
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# t5-small-finetuned-cnn-news
+This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.8421
+- Rouge1: 24.4309
+- Rouge2: 12.1268
+- Rougel: 20.3697
+- Rougelsum: 23.18
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.00056
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 5
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 2.0267        | 1.0   | 718  | 1.8134          | 24.5086 | 12.0372 | 20.3241 | 23.2338   |
+| 1.8289        | 2.0   | 1436 | 1.8150          | 24.4861 | 12.1833 | 20.5262 | 23.3358   |
+| 1.6833        | 3.0   | 2154 | 1.8148          | 23.9202 | 11.7941 | 19.9514 | 22.7185   |
+| 1.576         | 4.0   | 2872 | 1.8271          | 24.2367 | 11.8778 | 20.1292 | 23.0104   |
+| 1.4965        | 5.0   | 3590 | 1.8421          | 24.4309 | 12.1268 | 20.3697 | 23.18     |
+### Framework versions
+- Transformers 4.42.4
+- Pytorch 2.3.1+cu121
+- Datasets 2.20.0
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "decoder_start_token_id": 0,
+  "eos_token_id": 1,
+  "pad_token_id": 0,
+  "transformers_version": "4.42.4"
+}

runs/Jul24_07-54-54_479fe7a920df/events.out.tfevents.1721807717.479fe7a920df.237.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:845311fb906250aaeee766218ba3a3f23da297385da0a2041a5215421cfca626
-size 8801

 version https://git-lfs.github.com/spec/v1
+oid sha256:aa3fada5c1d82af8095277b74ad2c0bde914c4d032c292960d8e9c1813eefd1e
+size 9629

runs/Jul24_07-54-54_479fe7a920df/events.out.tfevents.1721810464.479fe7a920df.237.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d07567b836a38208dcc1d78cb749827494f7e78512d681eb40c9918704db7a3a
+size 562