File size: 13,416 Bytes
f4d93a6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 |
[hops] 2024-09-23 22:03:10.261 | INFO | Initializing a parser from /workspace/configs/exp_camembertv2/camembertv2_base_p2_17k_last_layer.yaml
[hops] 2024-09-23 22:03:10.554 | INFO | Generating a FastText model from the treebank
[hops] 2024-09-23 22:03:10.645 | INFO | Training fasttext model
[hops] 2024-09-23 22:03:12.421 | WARNING | Some weights of RobertaModel were not initialized from the model checkpoint at /scratch/camembertv2/runs/models/camembertv2-base-bf16/post/ckpt-p2-17000/pt/ and are newly initialized: ['roberta.pooler.dense.bias', 'roberta.pooler.dense.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
[hops] 2024-09-23 22:03:24.938 | INFO | Start training on cuda:0
[hops] 2024-09-23 22:03:24.944 | WARNING | You're using a RobertaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
[hops] 2024-09-23 22:04:54.185 | INFO | Epoch 0: train loss 1.1740 dev loss 0.3235 dev tag acc 93.16% dev head acc 88.90% dev deprel acc 91.95%
[hops] 2024-09-23 22:04:54.186 | INFO | New best model: head accuracy 88.90% > 0.00%
[hops] 2024-09-23 22:06:23.342 | INFO | Epoch 1: train loss 0.2838 dev loss 0.1626 dev tag acc 97.85% dev head acc 94.35% dev deprel acc 95.43%
[hops] 2024-09-23 22:06:23.343 | INFO | New best model: head accuracy 94.35% > 88.90%
[hops] 2024-09-23 22:07:57.103 | INFO | Epoch 2: train loss 0.1673 dev loss 0.1337 dev tag acc 98.24% dev head acc 95.40% dev deprel acc 96.71%
[hops] 2024-09-23 22:07:57.104 | INFO | New best model: head accuracy 95.40% > 94.35%
[hops] 2024-09-23 22:09:24.832 | INFO | Epoch 3: train loss 0.1227 dev loss 0.1438 dev tag acc 98.43% dev head acc 95.90% dev deprel acc 96.81%
[hops] 2024-09-23 22:09:24.833 | INFO | New best model: head accuracy 95.90% > 95.40%
[hops] 2024-09-23 22:10:55.835 | INFO | Epoch 4: train loss 0.0968 dev loss 0.1418 dev tag acc 98.43% dev head acc 96.14% dev deprel acc 97.20%
[hops] 2024-09-23 22:10:55.836 | INFO | New best model: head accuracy 96.14% > 95.90%
[hops] 2024-09-23 22:12:27.472 | INFO | Epoch 5: train loss 0.0793 dev loss 0.1568 dev tag acc 98.52% dev head acc 96.19% dev deprel acc 97.34%
[hops] 2024-09-23 22:12:27.473 | INFO | New best model: head accuracy 96.19% > 96.14%
[hops] 2024-09-23 22:13:56.231 | INFO | Epoch 6: train loss 0.0667 dev loss 0.1516 dev tag acc 98.57% dev head acc 96.40% dev deprel acc 97.36%
[hops] 2024-09-23 22:13:56.232 | INFO | New best model: head accuracy 96.40% > 96.19%
[hops] 2024-09-23 22:15:26.147 | INFO | Epoch 7: train loss 0.0566 dev loss 0.1687 dev tag acc 98.58% dev head acc 96.52% dev deprel acc 97.52%
[hops] 2024-09-23 22:15:26.148 | INFO | New best model: head accuracy 96.52% > 96.40%
[hops] 2024-09-23 22:16:55.419 | INFO | Epoch 8: train loss 0.0500 dev loss 0.1826 dev tag acc 98.64% dev head acc 96.53% dev deprel acc 97.42%
[hops] 2024-09-23 22:16:55.420 | INFO | New best model: head accuracy 96.53% > 96.52%
[hops] 2024-09-23 22:18:28.485 | INFO | Epoch 9: train loss 0.0425 dev loss 0.1906 dev tag acc 98.59% dev head acc 96.56% dev deprel acc 97.56%
[hops] 2024-09-23 22:18:28.486 | INFO | New best model: head accuracy 96.56% > 96.53%
[hops] 2024-09-23 22:20:02.390 | INFO | Epoch 10: train loss 0.0377 dev loss 0.2151 dev tag acc 98.55% dev head acc 96.60% dev deprel acc 97.51%
[hops] 2024-09-23 22:20:02.390 | INFO | New best model: head accuracy 96.60% > 96.56%
[hops] 2024-09-23 22:21:31.071 | INFO | Epoch 11: train loss 0.0332 dev loss 0.2276 dev tag acc 98.59% dev head acc 96.62% dev deprel acc 97.53%
[hops] 2024-09-23 22:21:31.072 | INFO | New best model: head accuracy 96.62% > 96.60%
[hops] 2024-09-23 22:22:58.744 | INFO | Epoch 12: train loss 0.0299 dev loss 0.2397 dev tag acc 98.59% dev head acc 96.62% dev deprel acc 97.49%
[hops] 2024-09-23 22:22:58.745 | INFO | New best model: head accuracy 96.62% > 96.62%
[hops] 2024-09-23 22:24:30.195 | INFO | Epoch 13: train loss 0.0270 dev loss 0.2548 dev tag acc 98.64% dev head acc 96.45% dev deprel acc 97.61%
[hops] 2024-09-23 22:25:58.937 | INFO | Epoch 14: train loss 0.0247 dev loss 0.2351 dev tag acc 98.69% dev head acc 96.51% dev deprel acc 97.60%
[hops] 2024-09-23 22:27:26.485 | INFO | Epoch 15: train loss 0.0219 dev loss 0.2812 dev tag acc 98.64% dev head acc 96.60% dev deprel acc 97.63%
[hops] 2024-09-23 22:28:53.871 | INFO | Epoch 16: train loss 0.0204 dev loss 0.2771 dev tag acc 98.64% dev head acc 96.70% dev deprel acc 97.59%
[hops] 2024-09-23 22:28:53.872 | INFO | New best model: head accuracy 96.70% > 96.62%
[hops] 2024-09-23 22:30:22.009 | INFO | Epoch 17: train loss 0.0193 dev loss 0.2966 dev tag acc 98.57% dev head acc 96.71% dev deprel acc 97.54%
[hops] 2024-09-23 22:30:22.010 | INFO | New best model: head accuracy 96.71% > 96.70%
[hops] 2024-09-23 22:31:50.178 | INFO | Epoch 18: train loss 0.0172 dev loss 0.3181 dev tag acc 98.65% dev head acc 96.63% dev deprel acc 97.61%
[hops] 2024-09-23 22:33:18.205 | INFO | Epoch 19: train loss 0.0163 dev loss 0.3030 dev tag acc 98.66% dev head acc 96.73% dev deprel acc 97.62%
[hops] 2024-09-23 22:33:18.206 | INFO | New best model: head accuracy 96.73% > 96.71%
[hops] 2024-09-23 22:34:52.436 | INFO | Epoch 20: train loss 0.0150 dev loss 0.3732 dev tag acc 98.64% dev head acc 96.74% dev deprel acc 97.44%
[hops] 2024-09-23 22:34:52.437 | INFO | New best model: head accuracy 96.74% > 96.73%
[hops] 2024-09-23 22:36:26.028 | INFO | Epoch 21: train loss 0.0139 dev loss 0.3404 dev tag acc 98.59% dev head acc 96.74% dev deprel acc 97.57%
[hops] 2024-09-23 22:36:26.029 | INFO | New best model: head accuracy 96.74% > 96.74%
[hops] 2024-09-23 22:37:56.614 | INFO | Epoch 22: train loss 0.0130 dev loss 0.3795 dev tag acc 98.66% dev head acc 96.59% dev deprel acc 97.59%
[hops] 2024-09-23 22:39:24.997 | INFO | Epoch 23: train loss 0.0120 dev loss 0.3572 dev tag acc 98.70% dev head acc 96.67% dev deprel acc 97.71%
[hops] 2024-09-23 22:40:54.945 | INFO | Epoch 24: train loss 0.0114 dev loss 0.3795 dev tag acc 98.65% dev head acc 96.71% dev deprel acc 97.69%
[hops] 2024-09-23 22:42:25.287 | INFO | Epoch 25: train loss 0.0113 dev loss 0.3792 dev tag acc 98.57% dev head acc 96.60% dev deprel acc 97.59%
[hops] 2024-09-23 22:43:52.396 | INFO | Epoch 26: train loss 0.0105 dev loss 0.3807 dev tag acc 98.69% dev head acc 96.61% dev deprel acc 97.63%
[hops] 2024-09-23 22:45:20.429 | INFO | Epoch 27: train loss 0.0093 dev loss 0.4159 dev tag acc 98.66% dev head acc 96.71% dev deprel acc 97.65%
[hops] 2024-09-23 22:46:51.804 | INFO | Epoch 28: train loss 0.0088 dev loss 0.4024 dev tag acc 98.56% dev head acc 96.68% dev deprel acc 97.59%
[hops] 2024-09-23 22:48:21.306 | INFO | Epoch 29: train loss 0.0084 dev loss 0.4070 dev tag acc 98.58% dev head acc 96.69% dev deprel acc 97.66%
[hops] 2024-09-23 22:49:52.685 | INFO | Epoch 30: train loss 0.0085 dev loss 0.4418 dev tag acc 98.58% dev head acc 96.70% dev deprel acc 97.64%
[hops] 2024-09-23 22:51:21.719 | INFO | Epoch 31: train loss 0.0077 dev loss 0.4297 dev tag acc 98.62% dev head acc 96.67% dev deprel acc 97.66%
[hops] 2024-09-23 22:52:56.380 | INFO | Epoch 32: train loss 0.0070 dev loss 0.4392 dev tag acc 98.63% dev head acc 96.63% dev deprel acc 97.71%
[hops] 2024-09-23 22:54:24.344 | INFO | Epoch 33: train loss 0.0065 dev loss 0.5069 dev tag acc 98.69% dev head acc 96.65% dev deprel acc 97.61%
[hops] 2024-09-23 22:55:56.289 | INFO | Epoch 34: train loss 0.0066 dev loss 0.4738 dev tag acc 98.64% dev head acc 96.57% dev deprel acc 97.58%
[hops] 2024-09-23 22:57:26.001 | INFO | Epoch 35: train loss 0.0059 dev loss 0.4935 dev tag acc 98.60% dev head acc 96.62% dev deprel acc 97.57%
[hops] 2024-09-23 22:58:52.412 | INFO | Epoch 36: train loss 0.0056 dev loss 0.5007 dev tag acc 98.65% dev head acc 96.57% dev deprel acc 97.55%
[hops] 2024-09-23 23:00:21.973 | INFO | Epoch 37: train loss 0.0053 dev loss 0.5094 dev tag acc 98.60% dev head acc 96.71% dev deprel acc 97.54%
[hops] 2024-09-23 23:01:50.675 | INFO | Epoch 38: train loss 0.0051 dev loss 0.4747 dev tag acc 98.61% dev head acc 96.73% dev deprel acc 97.57%
[hops] 2024-09-23 23:03:21.971 | INFO | Epoch 39: train loss 0.0048 dev loss 0.5596 dev tag acc 98.65% dev head acc 96.73% dev deprel acc 97.65%
[hops] 2024-09-23 23:04:50.664 | INFO | Epoch 40: train loss 0.0043 dev loss 0.4880 dev tag acc 98.67% dev head acc 96.79% dev deprel acc 97.69%
[hops] 2024-09-23 23:04:50.665 | INFO | New best model: head accuracy 96.79% > 96.74%
[hops] 2024-09-23 23:06:18.898 | INFO | Epoch 41: train loss 0.0041 dev loss 0.5152 dev tag acc 98.69% dev head acc 96.68% dev deprel acc 97.65%
[hops] 2024-09-23 23:07:48.810 | INFO | Epoch 42: train loss 0.0042 dev loss 0.5796 dev tag acc 98.62% dev head acc 96.77% dev deprel acc 97.59%
[hops] 2024-09-23 23:09:19.338 | INFO | Epoch 43: train loss 0.0039 dev loss 0.5478 dev tag acc 98.66% dev head acc 96.69% dev deprel acc 97.69%
[hops] 2024-09-23 23:10:49.453 | INFO | Epoch 44: train loss 0.0034 dev loss 0.5761 dev tag acc 98.66% dev head acc 96.71% dev deprel acc 97.64%
[hops] 2024-09-23 23:12:20.508 | INFO | Epoch 45: train loss 0.0035 dev loss 0.5968 dev tag acc 98.64% dev head acc 96.75% dev deprel acc 97.63%
[hops] 2024-09-23 23:13:49.566 | INFO | Epoch 46: train loss 0.0032 dev loss 0.5657 dev tag acc 98.64% dev head acc 96.78% dev deprel acc 97.69%
[hops] 2024-09-23 23:15:19.300 | INFO | Epoch 47: train loss 0.0029 dev loss 0.6033 dev tag acc 98.67% dev head acc 96.72% dev deprel acc 97.68%
[hops] 2024-09-23 23:16:46.854 | INFO | Epoch 48: train loss 0.0029 dev loss 0.6110 dev tag acc 98.67% dev head acc 96.72% dev deprel acc 97.68%
[hops] 2024-09-23 23:18:14.068 | INFO | Epoch 49: train loss 0.0026 dev loss 0.6084 dev tag acc 98.68% dev head acc 96.74% dev deprel acc 97.67%
[hops] 2024-09-23 23:19:44.122 | INFO | Epoch 50: train loss 0.0025 dev loss 0.6095 dev tag acc 98.62% dev head acc 96.76% dev deprel acc 97.68%
[hops] 2024-09-23 23:21:10.264 | INFO | Epoch 51: train loss 0.0025 dev loss 0.6551 dev tag acc 98.69% dev head acc 96.71% dev deprel acc 97.73%
[hops] 2024-09-23 23:22:41.212 | INFO | Epoch 52: train loss 0.0022 dev loss 0.6374 dev tag acc 98.62% dev head acc 96.70% dev deprel acc 97.64%
[hops] 2024-09-23 23:24:09.182 | INFO | Epoch 53: train loss 0.0021 dev loss 0.6473 dev tag acc 98.64% dev head acc 96.72% dev deprel acc 97.64%
[hops] 2024-09-23 23:25:37.902 | INFO | Epoch 54: train loss 0.0019 dev loss 0.6793 dev tag acc 98.66% dev head acc 96.73% dev deprel acc 97.67%
[hops] 2024-09-23 23:27:06.796 | INFO | Epoch 55: train loss 0.0019 dev loss 0.6544 dev tag acc 98.66% dev head acc 96.76% dev deprel acc 97.70%
[hops] 2024-09-23 23:28:34.813 | INFO | Epoch 56: train loss 0.0016 dev loss 0.7122 dev tag acc 98.66% dev head acc 96.69% dev deprel acc 97.67%
[hops] 2024-09-23 23:30:01.304 | INFO | Epoch 57: train loss 0.0015 dev loss 0.7413 dev tag acc 98.65% dev head acc 96.69% dev deprel acc 97.68%
[hops] 2024-09-23 23:31:30.980 | INFO | Epoch 58: train loss 0.0015 dev loss 0.7386 dev tag acc 98.66% dev head acc 96.71% dev deprel acc 97.68%
[hops] 2024-09-23 23:33:00.851 | INFO | Epoch 59: train loss 0.0014 dev loss 0.7433 dev tag acc 98.65% dev head acc 96.80% dev deprel acc 97.68%
[hops] 2024-09-23 23:33:00.852 | INFO | New best model: head accuracy 96.80% > 96.79%
[hops] 2024-09-23 23:34:32.878 | INFO | Epoch 60: train loss 0.0014 dev loss 0.7406 dev tag acc 98.64% dev head acc 96.76% dev deprel acc 97.67%
[hops] 2024-09-23 23:36:01.771 | INFO | Epoch 61: train loss 0.0014 dev loss 0.7765 dev tag acc 98.63% dev head acc 96.75% dev deprel acc 97.65%
[hops] 2024-09-23 23:37:32.476 | INFO | Epoch 62: train loss 0.0012 dev loss 0.7706 dev tag acc 98.64% dev head acc 96.73% dev deprel acc 97.67%
[hops] 2024-09-23 23:38:59.167 | INFO | Epoch 63: train loss 0.0012 dev loss 0.7670 dev tag acc 98.64% dev head acc 96.74% dev deprel acc 97.67%
[hops] 2024-09-23 23:39:04.307 | WARNING | You're using a RobertaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
[hops] 2024-09-23 23:39:12.221 | WARNING | You're using a RobertaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
[hops] 2024-09-23 23:39:13.462 | INFO | Metrics for GSD-camembertv2_base_p2_17k_last_layer+rand_seed=42
βββββββββββββββββββββββββββββββ
Split UPOS UAS LAS
βββββββββββββββββββββββββββββββ
Dev 98.65 96.81 95.66
Test 98.66 95.77 94.32
βββββββββββββββββββββββββββββββ
|