Dmitry Chaplinsky
commited on
Commit
•
7342af5
1
Parent(s):
70d77ae
Updated model: 577 splits, 20.61 epochs, min_loss: 1.0161, min_ppl: 2.7625
Browse files- best-lm.pt +1 -1
- loss.txt +13 -0
best-lm.pt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 22791455
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0db3bfafb4c929b9fbc8c135468a166c65b521a0c98d865bc89b82a160f80c81
|
3 |
size 22791455
|
loss.txt
CHANGED
@@ -562,3 +562,16 @@
|
|
562 |
| end of split 86 / 28 | epoch 18 | time: 3245.23s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
563 |
| end of split 87 / 28 | epoch 18 | time: 3244.78s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
564 |
| end of split 88 / 28 | epoch 18 | time: 3241.61s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
562 |
| end of split 86 / 28 | epoch 18 | time: 3245.23s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
563 |
| end of split 87 / 28 | epoch 18 | time: 3244.78s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
564 |
| end of split 88 / 28 | epoch 18 | time: 3241.61s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
565 |
+
| end of split 89 / 28 | epoch 18 | time: 3196.37s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
566 |
+
| end of split 90 / 28 | epoch 18 | time: 3215.04s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
567 |
+
| end of split 91 / 28 | epoch 18 | time: 3226.07s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
568 |
+
| end of split 92 / 28 | epoch 18 | time: 3222.68s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
569 |
+
| end of split 93 / 28 | epoch 18 | time: 3224.21s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
570 |
+
| end of split 94 / 28 | epoch 18 | time: 3225.28s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
571 |
+
| end of split 95 / 28 | epoch 18 | time: 3229.80s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
572 |
+
| end of split 96 / 28 | epoch 18 | time: 3226.91s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
573 |
+
| end of split 97 / 28 | epoch 18 | time: 3230.45s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
574 |
+
| end of split 98 / 28 | epoch 18 | time: 3238.09s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
575 |
+
| end of split 99 / 28 | epoch 18 | time: 3154.25s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
576 |
+
| end of split 100 / 28 | epoch 18 | time: 3070.87s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0049
|
577 |
+
| end of split 101 / 28 | epoch 18 | time: 3198.30s | valid loss 1.0161 | valid ppl 2.7624 | learning rate 0.0012
|