eddieman78 commited on
Commit
f205877
1 Parent(s): c2d2ab8
Files changed (3) hide show
  1. README.md +9 -6
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - generated_from_trainer
5
- base_model: google/flan-t5-base
6
  model-index:
7
  - name: args-mem-base
8
  results: []
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 38.5896
19
 
20
  ## Model description
21
 
@@ -40,13 +40,16 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - training_steps: 10
44
 
45
  ### Training results
46
 
47
- | Training Loss | Epoch | Step | Validation Loss |
48
- |:-------------:|:-----:|:----:|:---------------:|
49
- | 38.5866 | 0.0 | 10 | 38.5896 |
 
 
 
50
 
51
 
52
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: google/flan-t5-base
4
  tags:
5
  - generated_from_trainer
 
6
  model-index:
7
  - name: args-mem-base
8
  results: []
 
15
 
16
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.0044
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 4
44
 
45
  ### Training results
46
 
47
+ | Training Loss | Epoch | Step | Validation Loss |
48
+ |:-------------:|:-----:|:-----:|:---------------:|
49
+ | 0.0094 | 1.0 | 13584 | 0.0058 |
50
+ | 0.0069 | 2.0 | 27168 | 0.0049 |
51
+ | 0.0062 | 3.0 | 40752 | 0.0044 |
52
+ | 0.0056 | 4.0 | 54336 | 0.0044 |
53
 
54
 
55
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fce2079d1ac6313aca32b2963bdd82f75f6f30ac4332ef454fa2ecd71782f336
3
  size 990345064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c1213b4b9175d40ea9784d400ac9d3522cfb1816fce4e431752ef0b3ae9bcb3
3
  size 990345064
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:73a811086531b4fe2a68f23fb03b3abbb416427bc3376fe71b98740953677a70
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:66d5b7d5f432b4969609bac67eb1228854773ab6347eba38c5bf5860ae047f92
3
  size 5048