apwic
/

indosum-base-0

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

indosum-base-0

This model is a fine-tuned version of LazarusNLP/IndoNanoT5-base on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.7170
Rouge1: 72.8591
Rouge2: 65.7981
Rougel: 69.8759
Rougelsum: 72.0557
Gen Len: 99.276

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.001
train_batch_size: 16
eval_batch_size: 32
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 5.0

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
1.2132	1.0	892	0.7742	67.4414	59.7409	64.517	66.4918	94.092
0.686	2.0	1784	0.6673	70.2138	62.8202	67.1553	69.3063	100.2933
0.491	3.0	2676	0.6274	71.2142	63.9943	68.2722	70.2971	100.944
0.343	4.0	3568	0.6469	71.7114	64.489	68.7214	70.7949	98.8227
0.2059	5.0	4460	0.7170	72.5364	65.2519	69.5637	71.6884	98.9053

Framework versions

Transformers 4.40.2
Pytorch 2.3.1+cu121
Datasets 2.20.0
Tokenizers 0.19.1

Downloads last month: 2

Safetensors

Model size

248M params

Tensor type

F32

·

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for apwic/indosum-base-0

Base model

LazarusNLP/IndoNanoT5-base

Finetuned

(53)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard