DUNHILL commited on
Commit
999248f
1 Parent(s): 943b2f4

Model save

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [dandelin/vilt-b32-mlm](https://huggingface.co/dandelin/vilt-b32-mlm) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.4644
20
 
21
  ## Model description
22
 
@@ -35,21 +35,22 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - learning_rate: 5e-05
39
  - train_batch_size: 8
40
  - eval_batch_size: 16
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - num_epochs: 3
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 0.4691 | 1.0 | 250 | 0.3940 |
51
- | 0.3872 | 2.0 | 500 | 0.4159 |
52
- | 0.3803 | 3.0 | 750 | 0.4644 |
 
53
 
54
 
55
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [dandelin/vilt-b32-mlm](https://huggingface.co/dandelin/vilt-b32-mlm) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 38.8391
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - learning_rate: 3e-05
39
  - train_batch_size: 8
40
  - eval_batch_size: 16
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
+ - num_epochs: 4
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | No log | 1.0 | 38 | 118.6701 |
51
+ | 181.5151 | 2.0 | 76 | 62.2544 |
52
+ | 64.466 | 3.0 | 114 | 43.5740 |
53
+ | 40.2832 | 4.0 | 152 | 38.8391 |
54
 
55
 
56
  ### Framework versions
runs/Oct18_10-47-34_4138cf165407/events.out.tfevents.1729248459.4138cf165407.464.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:adc01edf7acab2074cffc431f13c49889c5f929269e3aef241bf1552c0dadff1
3
- size 76180
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:78288500641c367a349e317dc88fdc9c433900d82c6cbd548cae1381bc627a25
3
+ size 76805