tgoktug commited on
Commit
474a867
1 Parent(s): ac3db37

Training in progress epoch 0

Browse files
Files changed (3) hide show
  1. README.md +4 -8
  2. tf_model.h5 +1 -1
  3. tokenizer.json +1 -1
README.md CHANGED
@@ -15,9 +15,9 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [facebook/blenderbot-400M-distill](https://huggingface.co/facebook/blenderbot-400M-distill) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 1.7632
19
- - Validation Loss: 1.8170
20
- - Epoch: 4
21
 
22
  ## Model description
23
 
@@ -43,11 +43,7 @@ The following hyperparameters were used during training:
43
 
44
  | Train Loss | Validation Loss | Epoch |
45
  |:----------:|:---------------:|:-----:|
46
- | 2.6342 | 2.1467 | 0 |
47
- | 2.1100 | 1.9767 | 1 |
48
- | 1.9563 | 1.8944 | 2 |
49
- | 1.8472 | 1.8479 | 3 |
50
- | 1.7632 | 1.8170 | 4 |
51
 
52
 
53
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [facebook/blenderbot-400M-distill](https://huggingface.co/facebook/blenderbot-400M-distill) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 1.5401
19
+ - Validation Loss: 1.2105
20
+ - Epoch: 0
21
 
22
  ## Model description
23
 
 
43
 
44
  | Train Loss | Validation Loss | Epoch |
45
  |:----------:|:---------------:|:-----:|
46
+ | 1.5401 | 1.2105 | 0 |
 
 
 
 
47
 
48
 
49
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b6585dca286fd0ee66ff493e2f76953d9b2f776862193c085a523043a8b8dc7c
3
  size 1459650480
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a51f5226e26fb2c60d400f61b626db3289f658b75a63dc5c2c1a6ffeda1821ac
3
  size 1459650480
tokenizer.json CHANGED
@@ -2,7 +2,7 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 256,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 128,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },