apwic
/

indosum-base-1

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

apwic commited on Jul 19

Commit

1d75a08

•

1 Parent(s): 2600d01

Model save

Files changed (1) hide show

README.md +17 -17

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [LazarusNLP/IndoNanoT5-base](https://huggingface.co/LazarusNLP/IndoNanoT5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7285
-- Rouge1: 71.2709
-- Rouge2: 63.9704
-- Rougel: 68.0718
-- Rougelsum: 70.4345
-- Gen Len: 98.3792
 ## Model description
@@ -42,8 +42,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.001
-- train_batch_size: 4
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -51,18 +51,18 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len  |
-|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
-| 1.5428        | 1.0   | 3566  | 0.9651          | 65.543  | 57.2356 | 62.3412 | 64.6125   | 103.0561 |
-| 0.8422        | 2.0   | 7132  | 0.7883          | 68.7622 | 61.187  | 65.4631 | 67.9119   | 95.1615  |
-| 0.6457        | 3.0   | 10698 | 0.7254          | 69.2705 | 61.8962 | 66.1101 | 68.4083   | 102.7557 |
-| 0.4948        | 4.0   | 14264 | 0.6871          | 71.0668 | 63.8176 | 67.9618 | 70.2487   | 100.2109 |
-| 0.348         | 5.0   | 17830 | 0.7285          | 71.2709 | 63.9704 | 68.0718 | 70.4345   | 98.3792  |
 ### Framework versions
 - Transformers 4.40.2
-- Pytorch 2.3.0+cu121
-- Datasets 2.19.1
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [LazarusNLP/IndoNanoT5-base](https://huggingface.co/LazarusNLP/IndoNanoT5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7478
+- Rouge1: 72.0587
+- Rouge2: 64.7973
+- Rougel: 68.9279
+- Rougelsum: 71.3028
+- Gen Len: 99.3765
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.001
+- train_batch_size: 16
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len  |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
+| 1.1904        | 1.0   | 892  | 0.8053          | 65.8257 | 57.6167 | 62.6222 | 65.0027   | 95.8598  |
+| 0.6851        | 2.0   | 1784 | 0.6779          | 67.8889 | 60.0878 | 64.5868 | 66.9914   | 96.2911  |
+| 0.4856        | 3.0   | 2676 | 0.6460          | 70.9241 | 63.6363 | 67.8555 | 70.153    | 96.9212  |
+| 0.3358        | 4.0   | 3568 | 0.6565          | 69.9002 | 62.4    | 66.5928 | 69.0347   | 101.8745 |
+| 0.1973        | 5.0   | 4460 | 0.7478          | 72.0587 | 64.7973 | 68.9279 | 71.3028   | 99.3765  |
 ### Framework versions
 - Transformers 4.40.2
+- Pytorch 2.3.1+cu121
+- Datasets 2.20.0
 - Tokenizers 0.19.1