tilyupo commited on
Commit
1286a9d
1 Parent(s): 9dee6a7

batch_size=64

Browse files
Files changed (2) hide show
  1. README.md +9 -20
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -15,23 +15,9 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.4162
19
- - Epoch: 0
20
-
21
- <pre>
22
- WRONG!!! It is fine-tuned on validation data
23
-
24
- {'eval_loss': 0.3034636080265045,
25
- 'eval_bleu': 21.208712916114965,
26
- 'eval_rouge1': 57.0,
27
- 'eval_rouge2': 34.71,
28
- 'eval_rougeL': 50.2,
29
- 'eval_rougeLsum': 50.22,
30
- 'eval_exact': 0.03128947624137596,
31
- 'eval_runtime': 347.407,
32
- 'eval_samples_per_second': 29.622,
33
- 'eval_steps_per_second': 0.927}
34
- </pre>
35
 
36
  ## Model description
37
 
@@ -55,9 +41,12 @@ The following hyperparameters were used during training:
55
 
56
  ### Training results
57
 
58
- | Train Loss | Epoch |
59
- |:----------:|:-----:|
60
- | 0.4162 | 0 |
 
 
 
61
 
62
 
63
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.3095
19
+ - Validation Loss: 0.3903
20
+ - Epoch: 3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
21
 
22
  ## Model description
23
 
 
41
 
42
  ### Training results
43
 
44
+ | Train Loss | Validation Loss | Epoch |
45
+ |:----------:|:---------------:|:-----:|
46
+ | 0.6099 | 0.4054 | 0 |
47
+ | 0.3919 | 0.3899 | 1 |
48
+ | 0.3451 | 0.3880 | 2 |
49
+ | 0.3095 | 0.3903 | 3 |
50
 
51
 
52
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:510e4d4d93ebb2e2a6806c29348a9875a15cacbe0c0c7efa7554e609f4409706
3
  size 439831352
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8969cc622ef945ebc9df0f83c11e1250b64574e87d1096d5017b3a788bcb23c5
3
  size 439831352