Training complete

Browse files

Files changed (5) hide show

README.md +70 -0
generation_config.json +6 -0
model.safetensors +1 -1
runs/Jul28_19-52-03_04efe8f423c6/events.out.tfevents.1722196351.04efe8f423c6.4243.1 +2 -2
runs/Jul28_19-52-03_04efe8f423c6/events.out.tfevents.1722196454.04efe8f423c6.4243.2 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,70 @@

+---
+license: apache-2.0
+base_model: google/mt5-small
+tags:
+- summarization
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: mt5-small-finetuned-amazon-en-es
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# mt5-small-finetuned-amazon-en-es
+This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 3.8946
+- Rouge1: 0.0918
+- Rouge2: 0.0167
+- Rougel: 0.0876
+- Rougelsum: 0.0856
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5.6e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 8
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
+| 19.0313       | 1.0   | 80   | 6.9781          | 0.0159 | 0.0    | 0.0143 | 0.0159    |
+| 12.6263       | 2.0   | 160  | 5.4246          | 0.0355 | 0.0    | 0.0230 | 0.0230    |
+| 9.6692        | 3.0   | 240  | 4.8658          | 0.0264 | 0.0    | 0.0202 | 0.0202    |
+| 7.5987        | 4.0   | 320  | 4.4427          | 0.0624 | 0.0333 | 0.0613 | 0.0605    |
+| 6.6653        | 5.0   | 400  | 4.1524          | 0.0644 | 0.0    | 0.0574 | 0.0579    |
+| 6.1087        | 6.0   | 480  | 4.0012          | 0.0990 | 0.0167 | 0.0943 | 0.0935    |
+| 5.8293        | 7.0   | 560  | 3.9214          | 0.0918 | 0.0167 | 0.0876 | 0.0856    |
+| 5.8101        | 8.0   | 640  | 3.8946          | 0.0918 | 0.0167 | 0.0876 | 0.0856    |
+### Framework versions
+- Transformers 4.42.4
+- Pytorch 2.3.1+cu121
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "decoder_start_token_id": 0,
+  "eos_token_id": 1,
+  "pad_token_id": 0,
+  "transformers_version": "4.42.4"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eb83b84cbd2faec79e1f83eb04a22e62f9d4fb57e1b7091e8aa25508e43cd529
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:feeb76a15243ca9566f412288f76a6470aea84fb7a7485a9f5bf6361bddae611
 size 1200729512

runs/Jul28_19-52-03_04efe8f423c6/events.out.tfevents.1722196351.04efe8f423c6.4243.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b154f057c8496a542c1e6caccbbfbaf6359ddb11c9f12c7b6ecc146ad092109b
-size 9261

 version https://git-lfs.github.com/spec/v1
+oid sha256:fc71709924fbf406e96661883cdadfc14e16e9453ab9a949db2d4c0a5454f41b
+size 10985

runs/Jul28_19-52-03_04efe8f423c6/events.out.tfevents.1722196454.04efe8f423c6.4243.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6c88f2c1773f9543d682b4138c00d785f5dcdc2d13f50df467915ab1c5d2ac77
+size 562