Marcos12886 commited on
Commit
be8b76f
1 Parent(s): 65c1586

Upload folder using huggingface_hub

Browse files
Files changed (48) hide show
  1. README.md +11 -14
  2. checkpoint-170/model.safetensors +1 -1
  3. checkpoint-170/optimizer.pt +1 -1
  4. checkpoint-170/scheduler.pt +1 -1
  5. checkpoint-170/trainer_state.json +19 -19
  6. checkpoint-170/training_args.bin +1 -1
  7. checkpoint-256/model.safetensors +1 -1
  8. checkpoint-256/optimizer.pt +1 -1
  9. checkpoint-256/scheduler.pt +1 -1
  10. checkpoint-256/trainer_state.json +27 -27
  11. checkpoint-256/training_args.bin +1 -1
  12. checkpoint-340/config.json +81 -0
  13. checkpoint-340/model.safetensors +3 -0
  14. checkpoint-340/optimizer.pt +3 -0
  15. checkpoint-340/rng_state.pth +3 -0
  16. checkpoint-340/scheduler.pt +3 -0
  17. checkpoint-340/trainer_state.json +90 -0
  18. checkpoint-340/training_args.bin +3 -0
  19. checkpoint-341/model.safetensors +1 -1
  20. checkpoint-341/optimizer.pt +1 -1
  21. checkpoint-341/scheduler.pt +1 -1
  22. checkpoint-341/trainer_state.json +33 -33
  23. checkpoint-341/training_args.bin +1 -1
  24. checkpoint-426/model.safetensors +1 -1
  25. checkpoint-426/optimizer.pt +1 -1
  26. checkpoint-426/scheduler.pt +1 -1
  27. checkpoint-426/trainer_state.json +43 -43
  28. checkpoint-426/training_args.bin +1 -1
  29. checkpoint-510/config.json +81 -0
  30. checkpoint-510/model.safetensors +3 -0
  31. checkpoint-510/optimizer.pt +3 -0
  32. checkpoint-510/rng_state.pth +3 -0
  33. checkpoint-510/scheduler.pt +3 -0
  34. checkpoint-510/trainer_state.json +121 -0
  35. checkpoint-510/training_args.bin +3 -0
  36. checkpoint-85/model.safetensors +1 -1
  37. checkpoint-85/optimizer.pt +1 -1
  38. checkpoint-85/scheduler.pt +1 -1
  39. checkpoint-85/trainer_state.json +11 -11
  40. checkpoint-85/training_args.bin +1 -1
  41. model.safetensors +1 -1
  42. runs/Sep06_15-54-59_ubumarcos/events.out.tfevents.1725630900.ubumarcos +3 -0
  43. runs/Sep06_17-32-26_ubumarcos/events.out.tfevents.1725636747.ubumarcos +3 -0
  44. runs/Sep08_21-41-41_ubumarcos/events.out.tfevents.1725824502.ubumarcos +3 -0
  45. runs/Sep08_21-41-41_ubumarcos/events.out.tfevents.1725824503.ubumarcos +3 -0
  46. runs/Sep08_21-42-34_ubumarcos/events.out.tfevents.1725824555.ubumarcos +3 -0
  47. runs/Sep08_21-50-57_ubumarcos/events.out.tfevents.1725825058.ubumarcos +3 -0
  48. training_args.bin +1 -1
README.md CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.0359
25
- - Accuracy: 0.9883
26
- - Precision: 0.9883
27
- - Recall: 0.9883
28
- - F1: 0.9883
29
 
30
  ## Model description
31
 
@@ -44,7 +44,7 @@ More information needed
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
- - learning_rate: 5e-05
48
  - train_batch_size: 8
49
  - eval_batch_size: 8
50
  - seed: 123
@@ -53,19 +53,16 @@ The following hyperparameters were used during training:
53
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
54
  - lr_scheduler_type: cosine
55
  - lr_scheduler_warmup_ratio: 0.001
56
- - num_epochs: 10
57
 
58
  ### Training results
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
61
  |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
62
- | No log | 0.9956 | 85 | 0.1341 | 0.9538 | 0.9543 | 0.9538 | 0.9532 |
63
- | No log | 1.9912 | 170 | 0.0831 | 0.9700 | 0.9720 | 0.9700 | 0.9703 |
64
- | No log | 2.9985 | 256 | 0.0570 | 0.9846 | 0.9847 | 0.9846 | 0.9846 |
65
- | No log | 3.9941 | 341 | 0.0340 | 0.9897 | 0.9898 | 0.9897 | 0.9898 |
66
- | No log | 4.9898 | 426 | 0.0460 | 0.9875 | 0.9879 | 0.9875 | 0.9876 |
67
- | 0.0707 | 5.9971 | 512 | 0.0431 | 0.9897 | 0.9899 | 0.9897 | 0.9898 |
68
- | 0.0707 | 6.9927 | 597 | 0.0359 | 0.9883 | 0.9883 | 0.9883 | 0.9883 |
69
 
70
 
71
  ### Framework versions
 
21
 
22
  This model is a fine-tuned version of [ntu-spml/distilhubert](https://huggingface.co/ntu-spml/distilhubert) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.0625
25
+ - Accuracy: 0.9824
26
+ - Precision: 0.9825
27
+ - Recall: 0.9824
28
+ - F1: 0.9824
29
 
30
  ## Model description
31
 
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
+ - learning_rate: 3e-05
48
  - train_batch_size: 8
49
  - eval_batch_size: 8
50
  - seed: 123
 
53
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
54
  - lr_scheduler_type: cosine
55
  - lr_scheduler_warmup_ratio: 0.001
56
+ - num_epochs: 4
57
 
58
  ### Training results
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
61
  |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
62
+ | No log | 0.9956 | 85 | 0.1378 | 0.9546 | 0.9543 | 0.9546 | 0.9544 |
63
+ | No log | 1.9912 | 170 | 0.0802 | 0.9714 | 0.9713 | 0.9714 | 0.9714 |
64
+ | No log | 2.9985 | 256 | 0.0682 | 0.9780 | 0.9783 | 0.9780 | 0.9781 |
65
+ | No log | 3.9824 | 340 | 0.0625 | 0.9824 | 0.9825 | 0.9824 | 0.9824 |
 
 
 
66
 
67
 
68
  ### Framework versions
checkpoint-170/model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:85ed9f40490b0d3b2eab42a74aa78295bc0d5dc610c89bcfd6132ab265cc46c1
3
  size 94763496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e8a90553b00c4439fad5a7ed1ff0a9f76e60e707d76e5d455fd77360dff39ae0
3
  size 94763496
checkpoint-170/optimizer.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:884f223ad6abdd025f9f4a7bfacafc6c26b7792676f01b2a92bf8115fe98f14b
3
  size 189552570
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2dba3cd8b01d3153e53635bf2a218b801d4fb4cfcf7a5f02fc1c0e0d267aeace
3
  size 189552570
checkpoint-170/scheduler.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9cb57d1cea1cb2499417a821f65ecd6374c5073b4605d8ad26f8414a8ef5b7bd
3
  size 1064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:44565a4b1d8937c55d485ff9cf817bf33c36c26adc047340853e15a70d1a023d
3
  size 1064
checkpoint-170/trainer_state.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "best_metric": 0.96996336996337,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-170",
4
  "epoch": 1.9912152269399708,
5
  "eval_steps": 500,
@@ -10,33 +10,33 @@
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
- "eval_accuracy": 0.9538461538461539,
14
- "eval_f1": 0.953235199825453,
15
- "eval_loss": 0.13410410284996033,
16
- "eval_precision": 0.9543176332312968,
17
- "eval_recall": 0.9538461538461539,
18
- "eval_runtime": 5.9191,
19
- "eval_samples_per_second": 230.61,
20
- "eval_steps_per_second": 28.89,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
- "eval_accuracy": 0.96996336996337,
26
- "eval_f1": 0.970275727358029,
27
- "eval_loss": 0.08308757096529007,
28
- "eval_precision": 0.9720059808561178,
29
- "eval_recall": 0.96996336996337,
30
- "eval_runtime": 5.9321,
31
- "eval_samples_per_second": 230.105,
32
- "eval_steps_per_second": 28.826,
33
  "step": 170
34
  }
35
  ],
36
  "logging_steps": 500,
37
- "max_steps": 850,
38
  "num_input_tokens_seen": 0,
39
- "num_train_epochs": 10,
40
  "save_steps": 500,
41
  "stateful_callbacks": {
42
  "EarlyStoppingCallback": {
 
1
  {
2
+ "best_metric": 0.9714285714285714,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-170",
4
  "epoch": 1.9912152269399708,
5
  "eval_steps": 500,
 
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
+ "eval_accuracy": 0.9545787545787546,
14
+ "eval_f1": 0.9543924849872077,
15
+ "eval_loss": 0.13782845437526703,
16
+ "eval_precision": 0.9543323172404159,
17
+ "eval_recall": 0.9545787545787546,
18
+ "eval_runtime": 6.1437,
19
+ "eval_samples_per_second": 222.18,
20
+ "eval_steps_per_second": 27.834,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
+ "eval_accuracy": 0.9714285714285714,
26
+ "eval_f1": 0.9713707024010776,
27
+ "eval_loss": 0.0802176371216774,
28
+ "eval_precision": 0.9713449610559759,
29
+ "eval_recall": 0.9714285714285714,
30
+ "eval_runtime": 6.0836,
31
+ "eval_samples_per_second": 224.373,
32
+ "eval_steps_per_second": 28.108,
33
  "step": 170
34
  }
35
  ],
36
  "logging_steps": 500,
37
+ "max_steps": 340,
38
  "num_input_tokens_seen": 0,
39
+ "num_train_epochs": 4,
40
  "save_steps": 500,
41
  "stateful_callbacks": {
42
  "EarlyStoppingCallback": {
checkpoint-170/training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:831297954221509dc3f32e8eaf01789c3f660cf430e186799d52e58ec3b5b334
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7bd8e997c05f23ae464180f8e5e9f881e9c68322e4fcd3f855e79e2e5d54fbb
3
  size 5240
checkpoint-256/model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b0ef8be12a66054ee49a311e5cbc1e77f0f3bfeb6e0808f2d1f3fb47764355ad
3
  size 94763496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18d5013bf626b187ac7bbd42a7fb1a1b824c821cd062c951de17f19d9fb95944
3
  size 94763496
checkpoint-256/optimizer.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1fa10a1f73ef67de699f17b8a38795e4419b28e161b6906950ff154e933bfac8
3
  size 189552570
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7187577ead7a6d35f35a5dc44c0b76b32036040837b41821bc67970a2dde3ed2
3
  size 189552570
checkpoint-256/scheduler.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b581d82b68cb684515c782403bb76f13250579eff8704b811486ec59bb8e6b6a
3
  size 1064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eed9846f2cf239e8810dae264b4e0fe505a115ec40d5b4f9abdb240e9e0a0fae
3
  size 1064
checkpoint-256/trainer_state.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "best_metric": 0.9846153846153847,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-256",
4
  "epoch": 2.998535871156662,
5
  "eval_steps": 500,
@@ -10,45 +10,45 @@
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
- "eval_accuracy": 0.9538461538461539,
14
- "eval_f1": 0.953235199825453,
15
- "eval_loss": 0.13410410284996033,
16
- "eval_precision": 0.9543176332312968,
17
- "eval_recall": 0.9538461538461539,
18
- "eval_runtime": 5.9191,
19
- "eval_samples_per_second": 230.61,
20
- "eval_steps_per_second": 28.89,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
- "eval_accuracy": 0.96996336996337,
26
- "eval_f1": 0.970275727358029,
27
- "eval_loss": 0.08308757096529007,
28
- "eval_precision": 0.9720059808561178,
29
- "eval_recall": 0.96996336996337,
30
- "eval_runtime": 5.9321,
31
- "eval_samples_per_second": 230.105,
32
- "eval_steps_per_second": 28.826,
33
  "step": 170
34
  },
35
  {
36
  "epoch": 2.998535871156662,
37
- "eval_accuracy": 0.9846153846153847,
38
- "eval_f1": 0.9845587007434289,
39
- "eval_loss": 0.05695493519306183,
40
- "eval_precision": 0.984652882646444,
41
- "eval_recall": 0.9846153846153847,
42
- "eval_runtime": 5.924,
43
- "eval_samples_per_second": 230.421,
44
- "eval_steps_per_second": 28.866,
45
  "step": 256
46
  }
47
  ],
48
  "logging_steps": 500,
49
- "max_steps": 850,
50
  "num_input_tokens_seen": 0,
51
- "num_train_epochs": 10,
52
  "save_steps": 500,
53
  "stateful_callbacks": {
54
  "EarlyStoppingCallback": {
 
1
  {
2
+ "best_metric": 0.978021978021978,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-256",
4
  "epoch": 2.998535871156662,
5
  "eval_steps": 500,
 
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
+ "eval_accuracy": 0.9545787545787546,
14
+ "eval_f1": 0.9543924849872077,
15
+ "eval_loss": 0.13782845437526703,
16
+ "eval_precision": 0.9543323172404159,
17
+ "eval_recall": 0.9545787545787546,
18
+ "eval_runtime": 6.1437,
19
+ "eval_samples_per_second": 222.18,
20
+ "eval_steps_per_second": 27.834,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
+ "eval_accuracy": 0.9714285714285714,
26
+ "eval_f1": 0.9713707024010776,
27
+ "eval_loss": 0.0802176371216774,
28
+ "eval_precision": 0.9713449610559759,
29
+ "eval_recall": 0.9714285714285714,
30
+ "eval_runtime": 6.0836,
31
+ "eval_samples_per_second": 224.373,
32
+ "eval_steps_per_second": 28.108,
33
  "step": 170
34
  },
35
  {
36
  "epoch": 2.998535871156662,
37
+ "eval_accuracy": 0.978021978021978,
38
+ "eval_f1": 0.9781077738703647,
39
+ "eval_loss": 0.06824816763401031,
40
+ "eval_precision": 0.9783204609221348,
41
+ "eval_recall": 0.978021978021978,
42
+ "eval_runtime": 6.0791,
43
+ "eval_samples_per_second": 224.54,
44
+ "eval_steps_per_second": 28.129,
45
  "step": 256
46
  }
47
  ],
48
  "logging_steps": 500,
49
+ "max_steps": 340,
50
  "num_input_tokens_seen": 0,
51
+ "num_train_epochs": 4,
52
  "save_steps": 500,
53
  "stateful_callbacks": {
54
  "EarlyStoppingCallback": {
checkpoint-256/training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:831297954221509dc3f32e8eaf01789c3f660cf430e186799d52e58ec3b5b334
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7bd8e997c05f23ae464180f8e5e9f881e9c68322e4fcd3f855e79e2e5d54fbb
3
  size 5240
checkpoint-340/config.json ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "ntu-spml/distilhubert",
3
+ "activation_dropout": 0.1,
4
+ "apply_spec_augment": false,
5
+ "architectures": [
6
+ "HubertForSequenceClassification"
7
+ ],
8
+ "attention_dropout": 0.1,
9
+ "bos_token_id": 1,
10
+ "classifier_proj_size": 256,
11
+ "conv_bias": false,
12
+ "conv_dim": [
13
+ 512,
14
+ 512,
15
+ 512,
16
+ 512,
17
+ 512,
18
+ 512,
19
+ 512
20
+ ],
21
+ "conv_kernel": [
22
+ 10,
23
+ 3,
24
+ 3,
25
+ 3,
26
+ 3,
27
+ 2,
28
+ 2
29
+ ],
30
+ "conv_stride": [
31
+ 5,
32
+ 2,
33
+ 2,
34
+ 2,
35
+ 2,
36
+ 2,
37
+ 2
38
+ ],
39
+ "ctc_loss_reduction": "sum",
40
+ "ctc_zero_infinity": false,
41
+ "do_stable_layer_norm": false,
42
+ "eos_token_id": 2,
43
+ "feat_extract_activation": "gelu",
44
+ "feat_extract_norm": "group",
45
+ "feat_proj_dropout": 0.0,
46
+ "feat_proj_layer_norm": false,
47
+ "final_dropout": 0.0,
48
+ "finetuning_task": "audio-classification",
49
+ "hidden_act": "gelu",
50
+ "hidden_dropout": 0.1,
51
+ "hidden_size": 768,
52
+ "id2label": {
53
+ "0": "crying",
54
+ "1": "no_crying"
55
+ },
56
+ "initializer_range": 0.02,
57
+ "intermediate_size": 3072,
58
+ "label2id": {
59
+ "crying": 0,
60
+ "no_crying": 1
61
+ },
62
+ "layer_norm_eps": 1e-05,
63
+ "layerdrop": 0.0,
64
+ "mask_feature_length": 10,
65
+ "mask_feature_min_masks": 0,
66
+ "mask_feature_prob": 0.0,
67
+ "mask_time_length": 10,
68
+ "mask_time_min_masks": 2,
69
+ "mask_time_prob": 0.05,
70
+ "model_type": "hubert",
71
+ "num_attention_heads": 12,
72
+ "num_conv_pos_embedding_groups": 16,
73
+ "num_conv_pos_embeddings": 128,
74
+ "num_feat_extract_layers": 7,
75
+ "num_hidden_layers": 2,
76
+ "pad_token_id": 0,
77
+ "torch_dtype": "float32",
78
+ "transformers_version": "4.44.2",
79
+ "use_weighted_layer_sum": false,
80
+ "vocab_size": 32
81
+ }
checkpoint-340/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:02108e5b5b6258f37f5cf8b56d34f77aa8b483e760491525d71597e196663a68
3
+ size 94763496
checkpoint-340/optimizer.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61952855204c730d99821b044d03e59d5a50dc455c9713aebc545cf6b998fc2c
3
+ size 189552570
checkpoint-340/rng_state.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7dd0ffd727edc5638a7e778a739fb7230d6944f557af90208a783e44c06584b3
3
+ size 14308
checkpoint-340/scheduler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:56113043840d93f5e84e0b3a8dd3434ecf0610eae88f4dc19d9d01634ed9afe0
3
+ size 1064
checkpoint-340/trainer_state.json ADDED
@@ -0,0 +1,90 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": 0.9824175824175824,
3
+ "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-340",
4
+ "epoch": 3.9824304538799415,
5
+ "eval_steps": 500,
6
+ "global_step": 340,
7
+ "is_hyper_param_search": false,
8
+ "is_local_process_zero": true,
9
+ "is_world_process_zero": true,
10
+ "log_history": [
11
+ {
12
+ "epoch": 0.9956076134699854,
13
+ "eval_accuracy": 0.9545787545787546,
14
+ "eval_f1": 0.9543924849872077,
15
+ "eval_loss": 0.13782845437526703,
16
+ "eval_precision": 0.9543323172404159,
17
+ "eval_recall": 0.9545787545787546,
18
+ "eval_runtime": 6.1437,
19
+ "eval_samples_per_second": 222.18,
20
+ "eval_steps_per_second": 27.834,
21
+ "step": 85
22
+ },
23
+ {
24
+ "epoch": 1.9912152269399708,
25
+ "eval_accuracy": 0.9714285714285714,
26
+ "eval_f1": 0.9713707024010776,
27
+ "eval_loss": 0.0802176371216774,
28
+ "eval_precision": 0.9713449610559759,
29
+ "eval_recall": 0.9714285714285714,
30
+ "eval_runtime": 6.0836,
31
+ "eval_samples_per_second": 224.373,
32
+ "eval_steps_per_second": 28.108,
33
+ "step": 170
34
+ },
35
+ {
36
+ "epoch": 2.998535871156662,
37
+ "eval_accuracy": 0.978021978021978,
38
+ "eval_f1": 0.9781077738703647,
39
+ "eval_loss": 0.06824816763401031,
40
+ "eval_precision": 0.9783204609221348,
41
+ "eval_recall": 0.978021978021978,
42
+ "eval_runtime": 6.0791,
43
+ "eval_samples_per_second": 224.54,
44
+ "eval_steps_per_second": 28.129,
45
+ "step": 256
46
+ },
47
+ {
48
+ "epoch": 3.9824304538799415,
49
+ "eval_accuracy": 0.9824175824175824,
50
+ "eval_f1": 0.9824454470096671,
51
+ "eval_loss": 0.06251030415296555,
52
+ "eval_precision": 0.9824939338474881,
53
+ "eval_recall": 0.9824175824175824,
54
+ "eval_runtime": 5.945,
55
+ "eval_samples_per_second": 229.604,
56
+ "eval_steps_per_second": 28.764,
57
+ "step": 340
58
+ }
59
+ ],
60
+ "logging_steps": 500,
61
+ "max_steps": 340,
62
+ "num_input_tokens_seen": 0,
63
+ "num_train_epochs": 4,
64
+ "save_steps": 500,
65
+ "stateful_callbacks": {
66
+ "EarlyStoppingCallback": {
67
+ "args": {
68
+ "early_stopping_patience": 3,
69
+ "early_stopping_threshold": 0.0
70
+ },
71
+ "attributes": {
72
+ "early_stopping_patience_counter": 0
73
+ }
74
+ },
75
+ "TrainerControl": {
76
+ "args": {
77
+ "should_epoch_stop": false,
78
+ "should_evaluate": false,
79
+ "should_log": false,
80
+ "should_save": true,
81
+ "should_training_stop": true
82
+ },
83
+ "attributes": {}
84
+ }
85
+ },
86
+ "total_flos": 4.9458806093568e+16,
87
+ "train_batch_size": 8,
88
+ "trial_name": null,
89
+ "trial_params": null
90
+ }
checkpoint-340/training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7bd8e997c05f23ae464180f8e5e9f881e9c68322e4fcd3f855e79e2e5d54fbb
3
+ size 5240
checkpoint-341/model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:937ffa1cb79f16f4e7753ea842c3e32fe34e525562027e5b33ce48fc4b308a82
3
  size 94763496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4aef887b699e0e2fffd8fb2dabc3a3ee00a823581110ae055322fc7dccddae5f
3
  size 94763496
checkpoint-341/optimizer.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2a6119421582bbc05f8c02a4a9aab4dfdd2cb761cc17d9b827d99f803877b837
3
  size 189552570
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1f9a8acfee398336615756edc66f296d1f801d2fde4edac691df5318d5bad0e5
3
  size 189552570
checkpoint-341/scheduler.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6383f793cae08448b1e6023eca64340792bd03b29faa8b0a3800859cbaf6c3ca
3
  size 1064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c8d108d39d63bcbfcec1770d1c6b68f7c54e97b38a7c6c27d715b349ae526e1
3
  size 1064
checkpoint-341/trainer_state.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "best_metric": 0.9897435897435898,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-341",
4
  "epoch": 3.994143484626647,
5
  "eval_steps": 500,
@@ -10,50 +10,50 @@
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
- "eval_accuracy": 0.9538461538461539,
14
- "eval_f1": 0.953235199825453,
15
- "eval_loss": 0.13410410284996033,
16
- "eval_precision": 0.9543176332312968,
17
- "eval_recall": 0.9538461538461539,
18
- "eval_runtime": 5.9191,
19
- "eval_samples_per_second": 230.61,
20
- "eval_steps_per_second": 28.89,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
- "eval_accuracy": 0.96996336996337,
26
- "eval_f1": 0.970275727358029,
27
- "eval_loss": 0.08308757096529007,
28
- "eval_precision": 0.9720059808561178,
29
- "eval_recall": 0.96996336996337,
30
- "eval_runtime": 5.9321,
31
- "eval_samples_per_second": 230.105,
32
- "eval_steps_per_second": 28.826,
33
  "step": 170
34
  },
35
  {
36
  "epoch": 2.998535871156662,
37
- "eval_accuracy": 0.9846153846153847,
38
- "eval_f1": 0.9845587007434289,
39
- "eval_loss": 0.05695493519306183,
40
- "eval_precision": 0.984652882646444,
41
- "eval_recall": 0.9846153846153847,
42
- "eval_runtime": 5.924,
43
- "eval_samples_per_second": 230.421,
44
- "eval_steps_per_second": 28.866,
45
  "step": 256
46
  },
47
  {
48
  "epoch": 3.994143484626647,
49
- "eval_accuracy": 0.9897435897435898,
50
- "eval_f1": 0.9897502348272303,
51
- "eval_loss": 0.03400224819779396,
52
- "eval_precision": 0.9897618176566864,
53
- "eval_recall": 0.9897435897435898,
54
- "eval_runtime": 5.9234,
55
- "eval_samples_per_second": 230.441,
56
- "eval_steps_per_second": 28.868,
57
  "step": 341
58
  }
59
  ],
 
1
  {
2
+ "best_metric": 0.9875457875457876,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-341",
4
  "epoch": 3.994143484626647,
5
  "eval_steps": 500,
 
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
+ "eval_accuracy": 0.9663003663003663,
14
+ "eval_f1": 0.9661425019393345,
15
+ "eval_loss": 0.10625720024108887,
16
+ "eval_precision": 0.966167198143384,
17
+ "eval_recall": 0.9663003663003663,
18
+ "eval_runtime": 6.2148,
19
+ "eval_samples_per_second": 219.638,
20
+ "eval_steps_per_second": 27.515,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
+ "eval_accuracy": 0.9824175824175824,
26
+ "eval_f1": 0.9824569651195038,
27
+ "eval_loss": 0.06455090641975403,
28
+ "eval_precision": 0.9825418778671476,
29
+ "eval_recall": 0.9824175824175824,
30
+ "eval_runtime": 5.7635,
31
+ "eval_samples_per_second": 236.835,
32
+ "eval_steps_per_second": 29.669,
33
  "step": 170
34
  },
35
  {
36
  "epoch": 2.998535871156662,
37
+ "eval_accuracy": 0.9868131868131869,
38
+ "eval_f1": 0.9868427238396278,
39
+ "eval_loss": 0.04568345472216606,
40
+ "eval_precision": 0.98691808355097,
41
+ "eval_recall": 0.9868131868131869,
42
+ "eval_runtime": 6.1234,
43
+ "eval_samples_per_second": 222.916,
44
+ "eval_steps_per_second": 27.926,
45
  "step": 256
46
  },
47
  {
48
  "epoch": 3.994143484626647,
49
+ "eval_accuracy": 0.9875457875457876,
50
+ "eval_f1": 0.9875962963533335,
51
+ "eval_loss": 0.049539219588041306,
52
+ "eval_precision": 0.9877998892958556,
53
+ "eval_recall": 0.9875457875457876,
54
+ "eval_runtime": 6.1353,
55
+ "eval_samples_per_second": 222.481,
56
+ "eval_steps_per_second": 27.871,
57
  "step": 341
58
  }
59
  ],
checkpoint-341/training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:831297954221509dc3f32e8eaf01789c3f660cf430e186799d52e58ec3b5b334
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:325406b343874b92345e9a863970ee7cd441131f2850f47ab026ed810d19a88c
3
  size 5240
checkpoint-426/model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2f2381bd8ac8db8cd3c2425166e4419c44d01eb2bdc4a00d95e3ff3806ebad52
3
  size 94763496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ccd0853b18fcef29f6c50370c40938c3fc157e6d6ab4f5a8cc0ee4de4c2a330
3
  size 94763496
checkpoint-426/optimizer.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ddad96bd00a38ac13e824dddeb8bec0bff93f4b8005f4f483d201e92de9bdadf
3
  size 189552570
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58cf1896c9f8575d9e790921f864e581859c2dd6014460c20f2981f52787a4a4
3
  size 189552570
checkpoint-426/scheduler.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5f3d79bed0dd03dfef1d407d59369329e4670d54e356165ddfc162563cf81d14
3
  size 1064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:77da8d31fe5105cdaffd1de28e960ad7041d43ef167667465d3bd1e966ae27b0
3
  size 1064
checkpoint-426/trainer_state.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "best_metric": 0.9897435897435898,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-341",
4
  "epoch": 4.989751098096632,
5
  "eval_steps": 500,
@@ -10,69 +10,69 @@
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
- "eval_accuracy": 0.9538461538461539,
14
- "eval_f1": 0.953235199825453,
15
- "eval_loss": 0.13410410284996033,
16
- "eval_precision": 0.9543176332312968,
17
- "eval_recall": 0.9538461538461539,
18
- "eval_runtime": 5.9191,
19
- "eval_samples_per_second": 230.61,
20
- "eval_steps_per_second": 28.89,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
- "eval_accuracy": 0.96996336996337,
26
- "eval_f1": 0.970275727358029,
27
- "eval_loss": 0.08308757096529007,
28
- "eval_precision": 0.9720059808561178,
29
- "eval_recall": 0.96996336996337,
30
- "eval_runtime": 5.9321,
31
- "eval_samples_per_second": 230.105,
32
- "eval_steps_per_second": 28.826,
33
  "step": 170
34
  },
35
  {
36
  "epoch": 2.998535871156662,
37
- "eval_accuracy": 0.9846153846153847,
38
- "eval_f1": 0.9845587007434289,
39
- "eval_loss": 0.05695493519306183,
40
- "eval_precision": 0.984652882646444,
41
- "eval_recall": 0.9846153846153847,
42
- "eval_runtime": 5.924,
43
- "eval_samples_per_second": 230.421,
44
- "eval_steps_per_second": 28.866,
45
  "step": 256
46
  },
47
  {
48
  "epoch": 3.994143484626647,
49
- "eval_accuracy": 0.9897435897435898,
50
- "eval_f1": 0.9897502348272303,
51
- "eval_loss": 0.03400224819779396,
52
- "eval_precision": 0.9897618176566864,
53
- "eval_recall": 0.9897435897435898,
54
- "eval_runtime": 5.9234,
55
- "eval_samples_per_second": 230.441,
56
- "eval_steps_per_second": 28.868,
57
  "step": 341
58
  },
59
  {
60
  "epoch": 4.989751098096632,
61
- "eval_accuracy": 0.9875457875457876,
62
- "eval_f1": 0.9876042969414931,
63
- "eval_loss": 0.04599935933947563,
64
- "eval_precision": 0.9879353582239604,
65
- "eval_recall": 0.9875457875457876,
66
- "eval_runtime": 5.9141,
67
- "eval_samples_per_second": 230.803,
68
- "eval_steps_per_second": 28.914,
69
  "step": 426
70
  }
71
  ],
72
  "logging_steps": 500,
73
- "max_steps": 850,
74
  "num_input_tokens_seen": 0,
75
- "num_train_epochs": 10,
76
  "save_steps": 500,
77
  "stateful_callbacks": {
78
  "EarlyStoppingCallback": {
 
1
  {
2
+ "best_metric": 0.991941391941392,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-341",
4
  "epoch": 4.989751098096632,
5
  "eval_steps": 500,
 
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
+ "eval_accuracy": 0.9597069597069597,
14
+ "eval_f1": 0.9599836188656065,
15
+ "eval_loss": 0.12706419825553894,
16
+ "eval_precision": 0.960694098126733,
17
+ "eval_recall": 0.9597069597069597,
18
+ "eval_runtime": 6.1839,
19
+ "eval_samples_per_second": 220.733,
20
+ "eval_steps_per_second": 27.652,
21
  "step": 85
22
  },
23
  {
24
  "epoch": 1.9912152269399708,
25
+ "eval_accuracy": 0.9787545787545787,
26
+ "eval_f1": 0.9788099592207686,
27
+ "eval_loss": 0.07009062170982361,
28
+ "eval_precision": 0.9789268917610887,
29
+ "eval_recall": 0.9787545787545787,
30
+ "eval_runtime": 6.2624,
31
+ "eval_samples_per_second": 217.969,
32
+ "eval_steps_per_second": 27.306,
33
  "step": 170
34
  },
35
  {
36
  "epoch": 2.998535871156662,
37
+ "eval_accuracy": 0.9831501831501831,
38
+ "eval_f1": 0.9831941055888854,
39
+ "eval_loss": 0.06720960885286331,
40
+ "eval_precision": 0.9832999772248476,
41
+ "eval_recall": 0.9831501831501831,
42
+ "eval_runtime": 5.9845,
43
+ "eval_samples_per_second": 228.09,
44
+ "eval_steps_per_second": 28.574,
45
  "step": 256
46
  },
47
  {
48
  "epoch": 3.994143484626647,
49
+ "eval_accuracy": 0.991941391941392,
50
+ "eval_f1": 0.9919444380146174,
51
+ "eval_loss": 0.04145079106092453,
52
+ "eval_precision": 0.9919487758943655,
53
+ "eval_recall": 0.991941391941392,
54
+ "eval_runtime": 6.1222,
55
+ "eval_samples_per_second": 222.958,
56
+ "eval_steps_per_second": 27.931,
57
  "step": 341
58
  },
59
  {
60
  "epoch": 4.989751098096632,
61
+ "eval_accuracy": 0.991941391941392,
62
+ "eval_f1": 0.9919444380146174,
63
+ "eval_loss": 0.037522196769714355,
64
+ "eval_precision": 0.9919487758943655,
65
+ "eval_recall": 0.991941391941392,
66
+ "eval_runtime": 6.1601,
67
+ "eval_samples_per_second": 221.586,
68
+ "eval_steps_per_second": 27.759,
69
  "step": 426
70
  }
71
  ],
72
  "logging_steps": 500,
73
+ "max_steps": 510,
74
  "num_input_tokens_seen": 0,
75
+ "num_train_epochs": 6,
76
  "save_steps": 500,
77
  "stateful_callbacks": {
78
  "EarlyStoppingCallback": {
checkpoint-426/training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:831297954221509dc3f32e8eaf01789c3f660cf430e186799d52e58ec3b5b334
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0f29e363429f3041b65000237638d975c16df70ba592fd3c2ce610392338bd3
3
  size 5240
checkpoint-510/config.json ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "ntu-spml/distilhubert",
3
+ "activation_dropout": 0.1,
4
+ "apply_spec_augment": false,
5
+ "architectures": [
6
+ "HubertForSequenceClassification"
7
+ ],
8
+ "attention_dropout": 0.1,
9
+ "bos_token_id": 1,
10
+ "classifier_proj_size": 256,
11
+ "conv_bias": false,
12
+ "conv_dim": [
13
+ 512,
14
+ 512,
15
+ 512,
16
+ 512,
17
+ 512,
18
+ 512,
19
+ 512
20
+ ],
21
+ "conv_kernel": [
22
+ 10,
23
+ 3,
24
+ 3,
25
+ 3,
26
+ 3,
27
+ 2,
28
+ 2
29
+ ],
30
+ "conv_stride": [
31
+ 5,
32
+ 2,
33
+ 2,
34
+ 2,
35
+ 2,
36
+ 2,
37
+ 2
38
+ ],
39
+ "ctc_loss_reduction": "sum",
40
+ "ctc_zero_infinity": false,
41
+ "do_stable_layer_norm": false,
42
+ "eos_token_id": 2,
43
+ "feat_extract_activation": "gelu",
44
+ "feat_extract_norm": "group",
45
+ "feat_proj_dropout": 0.0,
46
+ "feat_proj_layer_norm": false,
47
+ "final_dropout": 0.0,
48
+ "finetuning_task": "audio-classification",
49
+ "hidden_act": "gelu",
50
+ "hidden_dropout": 0.1,
51
+ "hidden_size": 768,
52
+ "id2label": {
53
+ "0": "crying",
54
+ "1": "no_crying"
55
+ },
56
+ "initializer_range": 0.02,
57
+ "intermediate_size": 3072,
58
+ "label2id": {
59
+ "crying": 0,
60
+ "no_crying": 1
61
+ },
62
+ "layer_norm_eps": 1e-05,
63
+ "layerdrop": 0.0,
64
+ "mask_feature_length": 10,
65
+ "mask_feature_min_masks": 0,
66
+ "mask_feature_prob": 0.0,
67
+ "mask_time_length": 10,
68
+ "mask_time_min_masks": 2,
69
+ "mask_time_prob": 0.05,
70
+ "model_type": "hubert",
71
+ "num_attention_heads": 12,
72
+ "num_conv_pos_embedding_groups": 16,
73
+ "num_conv_pos_embeddings": 128,
74
+ "num_feat_extract_layers": 7,
75
+ "num_hidden_layers": 2,
76
+ "pad_token_id": 0,
77
+ "torch_dtype": "float32",
78
+ "transformers_version": "4.44.2",
79
+ "use_weighted_layer_sum": false,
80
+ "vocab_size": 32
81
+ }
checkpoint-510/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4372169de69c47d33397222523009d2fdeed9d3854c7375e8590a1241949291d
3
+ size 94763496
checkpoint-510/optimizer.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:294fd84ccb3eb4e7c39baaa641e3c8f3ea74bdf3b1a46a34208e1257b56d0054
3
+ size 189552570
checkpoint-510/rng_state.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9cc7449d67ace31a2b3ec5d56092754753743418fdab51384b92915bdf832f21
3
+ size 14308
checkpoint-510/scheduler.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c10e0f825c3d4b6dffd76325c743990d99721ef8a75e18bb498ded17f46b6918
3
+ size 1064
checkpoint-510/trainer_state.json ADDED
@@ -0,0 +1,121 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": 0.991941391941392,
3
+ "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-341",
4
+ "epoch": 5.973645680819912,
5
+ "eval_steps": 500,
6
+ "global_step": 510,
7
+ "is_hyper_param_search": false,
8
+ "is_local_process_zero": true,
9
+ "is_world_process_zero": true,
10
+ "log_history": [
11
+ {
12
+ "epoch": 0.9956076134699854,
13
+ "eval_accuracy": 0.9597069597069597,
14
+ "eval_f1": 0.9599836188656065,
15
+ "eval_loss": 0.12706419825553894,
16
+ "eval_precision": 0.960694098126733,
17
+ "eval_recall": 0.9597069597069597,
18
+ "eval_runtime": 6.1839,
19
+ "eval_samples_per_second": 220.733,
20
+ "eval_steps_per_second": 27.652,
21
+ "step": 85
22
+ },
23
+ {
24
+ "epoch": 1.9912152269399708,
25
+ "eval_accuracy": 0.9787545787545787,
26
+ "eval_f1": 0.9788099592207686,
27
+ "eval_loss": 0.07009062170982361,
28
+ "eval_precision": 0.9789268917610887,
29
+ "eval_recall": 0.9787545787545787,
30
+ "eval_runtime": 6.2624,
31
+ "eval_samples_per_second": 217.969,
32
+ "eval_steps_per_second": 27.306,
33
+ "step": 170
34
+ },
35
+ {
36
+ "epoch": 2.998535871156662,
37
+ "eval_accuracy": 0.9831501831501831,
38
+ "eval_f1": 0.9831941055888854,
39
+ "eval_loss": 0.06720960885286331,
40
+ "eval_precision": 0.9832999772248476,
41
+ "eval_recall": 0.9831501831501831,
42
+ "eval_runtime": 5.9845,
43
+ "eval_samples_per_second": 228.09,
44
+ "eval_steps_per_second": 28.574,
45
+ "step": 256
46
+ },
47
+ {
48
+ "epoch": 3.994143484626647,
49
+ "eval_accuracy": 0.991941391941392,
50
+ "eval_f1": 0.9919444380146174,
51
+ "eval_loss": 0.04145079106092453,
52
+ "eval_precision": 0.9919487758943655,
53
+ "eval_recall": 0.991941391941392,
54
+ "eval_runtime": 6.1222,
55
+ "eval_samples_per_second": 222.958,
56
+ "eval_steps_per_second": 27.931,
57
+ "step": 341
58
+ },
59
+ {
60
+ "epoch": 4.989751098096632,
61
+ "eval_accuracy": 0.991941391941392,
62
+ "eval_f1": 0.9919444380146174,
63
+ "eval_loss": 0.037522196769714355,
64
+ "eval_precision": 0.9919487758943655,
65
+ "eval_recall": 0.991941391941392,
66
+ "eval_runtime": 6.1601,
67
+ "eval_samples_per_second": 221.586,
68
+ "eval_steps_per_second": 27.759,
69
+ "step": 426
70
+ },
71
+ {
72
+ "epoch": 5.856515373352855,
73
+ "grad_norm": 0.05031043663620949,
74
+ "learning_rate": 9.520643773712201e-08,
75
+ "loss": 0.056,
76
+ "step": 500
77
+ },
78
+ {
79
+ "epoch": 5.973645680819912,
80
+ "eval_accuracy": 0.991941391941392,
81
+ "eval_f1": 0.9919444380146174,
82
+ "eval_loss": 0.037607260048389435,
83
+ "eval_precision": 0.9919487758943655,
84
+ "eval_recall": 0.991941391941392,
85
+ "eval_runtime": 6.0461,
86
+ "eval_samples_per_second": 225.765,
87
+ "eval_steps_per_second": 28.283,
88
+ "step": 510
89
+ }
90
+ ],
91
+ "logging_steps": 500,
92
+ "max_steps": 510,
93
+ "num_input_tokens_seen": 0,
94
+ "num_train_epochs": 6,
95
+ "save_steps": 500,
96
+ "stateful_callbacks": {
97
+ "EarlyStoppingCallback": {
98
+ "args": {
99
+ "early_stopping_patience": 3,
100
+ "early_stopping_threshold": 0.0
101
+ },
102
+ "attributes": {
103
+ "early_stopping_patience_counter": 0
104
+ }
105
+ },
106
+ "TrainerControl": {
107
+ "args": {
108
+ "should_epoch_stop": false,
109
+ "should_evaluate": false,
110
+ "should_log": false,
111
+ "should_save": true,
112
+ "should_training_stop": true
113
+ },
114
+ "attributes": {}
115
+ }
116
+ },
117
+ "total_flos": 7.418366078592e+16,
118
+ "train_batch_size": 8,
119
+ "trial_name": null,
120
+ "trial_params": null
121
+ }
checkpoint-510/training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0f29e363429f3041b65000237638d975c16df70ba592fd3c2ce610392338bd3
3
+ size 5240
checkpoint-85/model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:735ade2638b1f118fe459d58e0f846d8676adb708417e44d45e5c5631286bd6c
3
  size 94763496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:667f991dfd621ded0f030ae62bee9e55c4e297c107c7ab609989f56ba7da4287
3
  size 94763496
checkpoint-85/optimizer.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f9c2df0704bbd0ae60f95b2890a8b3ca3655b01d98a8e15e46e75396add56c28
3
  size 189552570
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b7d98f10852c7c698f2667e9c22700b698aaca9caf2c11ddb129f3cb85eeb89
3
  size 189552570
checkpoint-85/scheduler.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:627e28ee33138f33050cb3ea94003427c4a362d22084d2243ecd2a8541bf03ff
3
  size 1064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:793aa52bb071f6372012884f8d464f730e19ee122d0faba28c109d42d303933f
3
  size 1064
checkpoint-85/trainer_state.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "best_metric": 0.9538461538461539,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-85",
4
  "epoch": 0.9956076134699854,
5
  "eval_steps": 500,
@@ -10,21 +10,21 @@
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
- "eval_accuracy": 0.9538461538461539,
14
- "eval_f1": 0.953235199825453,
15
- "eval_loss": 0.13410410284996033,
16
- "eval_precision": 0.9543176332312968,
17
- "eval_recall": 0.9538461538461539,
18
- "eval_runtime": 5.9191,
19
- "eval_samples_per_second": 230.61,
20
- "eval_steps_per_second": 28.89,
21
  "step": 85
22
  }
23
  ],
24
  "logging_steps": 500,
25
- "max_steps": 850,
26
  "num_input_tokens_seen": 0,
27
- "num_train_epochs": 10,
28
  "save_steps": 500,
29
  "stateful_callbacks": {
30
  "EarlyStoppingCallback": {
 
1
  {
2
+ "best_metric": 0.9545787545787546,
3
  "best_model_checkpoint": "distilhubert-finetuned-cry-detector/checkpoint-85",
4
  "epoch": 0.9956076134699854,
5
  "eval_steps": 500,
 
10
  "log_history": [
11
  {
12
  "epoch": 0.9956076134699854,
13
+ "eval_accuracy": 0.9545787545787546,
14
+ "eval_f1": 0.9543924849872077,
15
+ "eval_loss": 0.13782845437526703,
16
+ "eval_precision": 0.9543323172404159,
17
+ "eval_recall": 0.9545787545787546,
18
+ "eval_runtime": 6.1437,
19
+ "eval_samples_per_second": 222.18,
20
+ "eval_steps_per_second": 27.834,
21
  "step": 85
22
  }
23
  ],
24
  "logging_steps": 500,
25
+ "max_steps": 340,
26
  "num_input_tokens_seen": 0,
27
+ "num_train_epochs": 4,
28
  "save_steps": 500,
29
  "stateful_callbacks": {
30
  "EarlyStoppingCallback": {
checkpoint-85/training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:831297954221509dc3f32e8eaf01789c3f660cf430e186799d52e58ec3b5b334
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7bd8e997c05f23ae464180f8e5e9f881e9c68322e4fcd3f855e79e2e5d54fbb
3
  size 5240
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:937ffa1cb79f16f4e7753ea842c3e32fe34e525562027e5b33ce48fc4b308a82
3
  size 94763496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:02108e5b5b6258f37f5cf8b56d34f77aa8b483e760491525d71597e196663a68
3
  size 94763496
runs/Sep06_15-54-59_ubumarcos/events.out.tfevents.1725630900.ubumarcos ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1acef9821029fd1212d748efd26c746ef609728916bb04756b301a4027aed9d7
3
+ size 5850
runs/Sep06_17-32-26_ubumarcos/events.out.tfevents.1725636747.ubumarcos ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89395943d8de67b3553472ad01aa836188c676359cc9b10e225dd488dbb1d7ac
3
+ size 9238
runs/Sep08_21-41-41_ubumarcos/events.out.tfevents.1725824502.ubumarcos ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89c619e470e68f874e4cb0664e10f8a4fdf35a1bdd0c2527edd075706f92d81a
3
+ size 5851
runs/Sep08_21-41-41_ubumarcos/events.out.tfevents.1725824503.ubumarcos ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90c52b45df2498fc461d9ed2383ef00e8683f52764e29e553f2a7d683c6062eb
3
+ size 5851
runs/Sep08_21-42-34_ubumarcos/events.out.tfevents.1725824555.ubumarcos ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c2a93d1da540cf716d2ca78b8fc4f898ca82630a2e7b3aef2aef2cf70d974709
3
+ size 7730
runs/Sep08_21-50-57_ubumarcos/events.out.tfevents.1725825058.ubumarcos ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:733dd4196f81f5f0672f6e91223ee12db5b02db4b8f6bfa80103cd6ccffdce87
3
+ size 8083
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:831297954221509dc3f32e8eaf01789c3f660cf430e186799d52e58ec3b5b334
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7bd8e997c05f23ae464180f8e5e9f881e9c68322e4fcd3f855e79e2e5d54fbb
3
  size 5240