sentiment_ita

Browse files

Files changed (7) hide show

README.md +14 -45
model.safetensors +1 -1
runs/Nov26_11-54-48_844b425e6119/events.out.tfevents.1700999696.844b425e6119.823.0 +3 -0
runs/Nov26_11-56-01_844b425e6119/events.out.tfevents.1700999763.844b425e6119.1887.0 +3 -0
runs/Nov26_12-03-11_844b425e6119/events.out.tfevents.1701000192.844b425e6119.3787.0 +3 -0
tokenizer.json +2 -16
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -3,26 +3,11 @@ license: mit
 base_model: neuraly/bert-base-italian-cased-sentiment
 tags:
 - generated_from_trainer
-datasets:
-- tweet_sentiment_multilingual
 metrics:
 - accuracy
 model-index:
 - name: sentiment_ita
-  results:
-  - task:
-      name: Text Classification
-      type: text-classification
-    dataset:
-      name: tweet_sentiment_multilingual
-      type: tweet_sentiment_multilingual
-      config: italian
-      split: validation
-      args: italian
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.6790123456790124
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # sentiment_ita
-This model is a fine-tuned version of [neuraly/bert-base-italian-cased-sentiment](https://huggingface.co/neuraly/bert-base-italian-cased-sentiment) on the tweet_sentiment_multilingual dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3764
-- Accuracy: 0.6790
 ## Model description
@@ -53,42 +38,26 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1000
 - num_epochs: 14
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 1.7769        | 0.57  | 100  | 1.2783          | 0.6204   |
-| 0.9499        | 1.14  | 200  | 0.8864          | 0.6389   |
-| 0.7183        | 1.7   | 300  | 0.8164          | 0.6543   |
-| 0.6378        | 2.27  | 400  | 0.8025          | 0.6821   |
-| 0.565         | 2.84  | 500  | 0.7971          | 0.6790   |
-| 0.4493        | 3.41  | 600  | 0.8769          | 0.6728   |
-| 0.4229        | 3.98  | 700  | 0.8342          | 0.6790   |
-| 0.2503        | 4.55  | 800  | 1.1654          | 0.6728   |
-| 0.2372        | 5.11  | 900  | 1.0713          | 0.6759   |
-| 0.1445        | 5.68  | 1000 | 1.3554          | 0.6883   |
-| 0.1329        | 6.25  | 1100 | 1.5381          | 0.6605   |
-| 0.1023        | 6.82  | 1200 | 1.5506          | 0.6914   |
-| 0.0542        | 7.39  | 1300 | 1.8265          | 0.6852   |
-| 0.0472        | 7.95  | 1400 | 2.0343          | 0.6698   |
-| 0.0227        | 8.52  | 1500 | 2.1335          | 0.6728   |
-| 0.0247        | 9.09  | 1600 | 2.1640          | 0.6605   |
-| 0.0165        | 9.66  | 1700 | 2.1982          | 0.6759   |
-| 0.0069        | 10.23 | 1800 | 2.2789          | 0.6790   |
-| 0.0256        | 10.8  | 1900 | 2.2858          | 0.6883   |
-| 0.0125        | 11.36 | 2000 | 2.3091          | 0.6852   |
-| 0.0051        | 11.93 | 2100 | 2.3238          | 0.6821   |
-| 0.0014        | 12.5  | 2200 | 2.3700          | 0.6883   |
-| 0.0019        | 13.07 | 2300 | 2.3582          | 0.6790   |
-| 0.0038        | 13.64 | 2400 | 2.3764          | 0.6790   |
 ### Framework versions

 base_model: neuraly/bert-base-italian-cased-sentiment
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: sentiment_ita
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # sentiment_ita
+This model is a fine-tuned version of [neuraly/bert-base-italian-cased-sentiment](https://huggingface.co/neuraly/bert-base-italian-cased-sentiment) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6301
+- Accuracy: 0.6903
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 48
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 600
 - num_epochs: 14
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 1.5002        | 1.67  | 100  | 0.9894          | 0.6460   |
+| 0.6959        | 3.33  | 200  | 0.7681          | 0.6726   |
+| 0.5325        | 5.0   | 300  | 0.7615          | 0.6962   |
+| 0.349         | 6.67  | 400  | 0.8867          | 0.6932   |
+| 0.1798        | 8.33  | 500  | 1.1361          | 0.6873   |
+| 0.0983        | 10.0  | 600  | 1.3994          | 0.6962   |
+| 0.0412        | 11.67 | 700  | 1.5411          | 0.7109   |
+| 0.0293        | 13.33 | 800  | 1.6301          | 0.6903   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:83d9b56693fd38e852a4c0f1ef9d6c33ee6677fa79c814e716315e2595adbbea
 size 442815492

 version https://git-lfs.github.com/spec/v1
+oid sha256:ab0a2e0fd9c57c0c99c08360c96fb105942e6ab433034627be4e3e0f4b8dca2a
 size 442815492

runs/Nov26_11-54-48_844b425e6119/events.out.tfevents.1700999696.844b425e6119.823.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:58123828c2c333aa6d744be6ee80d497453c676af4227c25d5ae0cd77bdcd02c
+size 4416

runs/Nov26_11-56-01_844b425e6119/events.out.tfevents.1700999763.844b425e6119.1887.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8080cea45d7124c5086551396046e421cf35a449e9daea0b91c9417ce3711e8a
+size 6327

runs/Nov26_12-03-11_844b425e6119/events.out.tfevents.1701000192.844b425e6119.3787.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3f395bb57ab47848434f7b8254355edf139f577e2416c425fd82d2369089cd68
+size 8600

tokenizer.json CHANGED Viewed

@@ -1,21 +1,7 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 512,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
-  "padding": {
-    "strategy": {
-      "Fixed": 512
-    },
-    "direction": "Right",
-    "pad_to_multiple_of": null,
-    "pad_id": 0,
-    "pad_type_id": 0,
-    "pad_token": "[PAD]"
-  },
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": null,
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7f673dec4331874cad97a2311edfec227ef5859a924ea306872bc07bc229ea7f
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:257db300ef90ff556ff164ddd8845fca9630f28dba9d5b604292c872107112a0
 size 4155