bwahyuh commited on
Commit
f669ade
1 Parent(s): c556d85

Training complete

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
- license: apache-2.0
3
- base_model: indolem/indobertweet-base-uncased
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # gemash
20
 
21
- This model is a fine-tuned version of [indolem/indobertweet-base-uncased](https://huggingface.co/indolem/indobertweet-base-uncased) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.8641
24
- - Accuracy: 0.7867
25
- - Precision: 0.8007
26
- - Recall: 0.7963
27
- - F1: 0.7984
28
 
29
  ## Model description
30
 
@@ -55,10 +55,10 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
58
- | 1.3221 | 1.0 | 169 | 1.0269 | 0.6067 | 0.6758 | 0.6354 | 0.6353 |
59
- | 0.7301 | 2.0 | 338 | 0.7482 | 0.745 | 0.7819 | 0.7463 | 0.7601 |
60
- | 0.2799 | 3.0 | 507 | 0.8165 | 0.7683 | 0.7804 | 0.7911 | 0.7807 |
61
- | 0.0876 | 4.0 | 676 | 0.8641 | 0.7867 | 0.8007 | 0.7963 | 0.7984 |
62
 
63
 
64
  ### Framework versions
 
1
  ---
2
+ license: mit
3
+ base_model: indolem/indobert-base-uncased
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
18
 
19
  # gemash
20
 
21
+ This model is a fine-tuned version of [indolem/indobert-base-uncased](https://huggingface.co/indolem/indobert-base-uncased) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.8077
24
+ - Accuracy: 0.7517
25
+ - Precision: 0.7645
26
+ - Recall: 0.7614
27
+ - F1: 0.7619
28
 
29
  ## Model description
30
 
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
57
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
58
+ | 1.5073 | 1.0 | 169 | 1.3572 | 0.43 | 0.4010 | 0.3978 | 0.3732 |
59
+ | 1.1536 | 2.0 | 338 | 1.0170 | 0.6083 | 0.6592 | 0.5946 | 0.5925 |
60
+ | 0.7413 | 3.0 | 507 | 0.8618 | 0.7183 | 0.7324 | 0.7277 | 0.7269 |
61
+ | 0.446 | 4.0 | 676 | 0.8077 | 0.7517 | 0.7645 | 0.7614 | 0.7619 |
62
 
63
 
64
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "indolem/indobertweet-base-uncased",
3
  "architectures": [
4
  "BertForSequenceClassification"
5
  ],
@@ -7,7 +7,6 @@
7
  "bos_token_id": 0,
8
  "classifier_dropout": null,
9
  "eos_token_ids": 0,
10
- "gradient_checkpointing": false,
11
  "hidden_act": "gelu",
12
  "hidden_dropout_prob": 0.1,
13
  "hidden_size": 768,
 
1
  {
2
+ "_name_or_path": "indolem/indobert-base-uncased",
3
  "architectures": [
4
  "BertForSequenceClassification"
5
  ],
 
7
  "bos_token_id": 0,
8
  "classifier_dropout": null,
9
  "eos_token_ids": 0,
 
10
  "hidden_act": "gelu",
11
  "hidden_dropout_prob": 0.1,
12
  "hidden_size": 768,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:099c69f3b8fddff8e129621b0bdbd04958a183afd0e65e5a23da126f4ae40ae0
3
  size 442271748
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d209abe6c3c3a187ed4c4ee0c41d9ea9625426208e5ace06dbb22031e9240f0
3
  size 442271748
runs/Jun28_09-59-44_6ad905270735/events.out.tfevents.1719568785.6ad905270735.326.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a9bf3e090a91a24f726018f935174a228af0566412cf3def6297716ed34b2c1
3
+ size 8240
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b893381d97d147abaca2a8bfb071547abe80fd8d6a0cacfdbc14bac157593135
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0acaedf1bcdf5772bea4c3ed4fb82df6299f9566f3396d15eff7acef3183a597
3
  size 5048
vocab.txt CHANGED
The diff for this file is too large to render. See raw diff