veronica-girolimetti commited on
Commit
15a0544
1 Parent(s): da3687e

End of training

Browse files
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google/flan-t5-small
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
+ model-index:
9
+ - name: test-dialogue-summarization-headers
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # test-dialogue-summarization-headers
17
+
18
+ This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 2.4536
21
+ - Rouge: {'rouge1': 44.8698, 'rouge2': 19.92, 'rougeL': 20.7147, 'rougeLsum': 20.7147}
22
+ - Bert Score: 0.8733
23
+ - Bleurt 20: -0.8548
24
+ - Gen Len: 15.305
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 2e-05
44
+ - train_batch_size: 7
45
+ - eval_batch_size: 7
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - num_epochs: 5
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Rouge | Bert Score | Bleurt 20 | Gen Len |
54
+ |:-------------:|:-----:|:----:|:---------------:|:-------------------------------------------------------------------------------:|:----------:|:---------:|:-------:|
55
+ | 2.6545 | 1.0 | 186 | 2.5321 | {'rouge1': 45.8489, 'rouge2': 19.7993, 'rougeL': 20.5196, 'rougeLsum': 20.5196} | 0.8737 | -0.8571 | 15.16 |
56
+ | 2.6779 | 2.0 | 372 | 2.4884 | {'rouge1': 44.2284, 'rouge2': 19.6646, 'rougeL': 20.6804, 'rougeLsum': 20.6804} | 0.8737 | -0.8594 | 15.13 |
57
+ | 2.6701 | 3.0 | 558 | 2.4682 | {'rouge1': 44.6249, 'rouge2': 19.9539, 'rougeL': 20.6036, 'rougeLsum': 20.6036} | 0.8737 | -0.8576 | 15.25 |
58
+ | 2.597 | 4.0 | 744 | 2.4582 | {'rouge1': 45.0018, 'rouge2': 19.7794, 'rougeL': 20.647, 'rougeLsum': 20.647} | 0.8739 | -0.8582 | 15.295 |
59
+ | 2.5861 | 5.0 | 930 | 2.4536 | {'rouge1': 44.8698, 'rouge2': 19.92, 'rougeL': 20.7147, 'rougeLsum': 20.7147} | 0.8733 | -0.8548 | 15.305 |
60
+
61
+
62
+ ### Framework versions
63
+
64
+ - Transformers 4.35.2
65
+ - Pytorch 2.1.0+cu121
66
+ - Datasets 2.16.1
67
+ - Tokenizers 0.15.0
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "decoder_start_token_id": 0,
3
+ "eos_token_id": 1,
4
+ "pad_token_id": 0,
5
+ "transformers_version": "4.35.2"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9f708a57c276fe056dc5c2013925ffb7787bac19c95e64b720a6186f688c1e1e
3
  size 307867048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79e8e09918ffd5142f9bdec5923abb0bbe3de7caa5567280c5f602c385f5e30d
3
  size 307867048
runs/Jan05_22-47-01_e8fd42101a63/events.out.tfevents.1704494823.e8fd42101a63.1295.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d19553d7f61e67a70075c0c9147d7e1f97ceea6f043e86f6c068bc4a0063af9c
3
- size 7095
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:740f185bcff291934ea1ddf6b5aa82d19904cacbcfabe9b38ee2414a61ff5e00
3
+ size 9678
runs/Jan05_22-47-01_e8fd42101a63/events.out.tfevents.1704495384.e8fd42101a63.1295.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5524ec18e14d32d472ce5a75cc24df2e90d7137ea2600de76d54632b50e014b
3
+ size 517