salbatarni
/

flan-t5-small-asap_t4_f1_prompt_adherence

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Edit model card

flan-t5-small-asap_t4_f1_prompt_adherence

This model is a fine-tuned version of google/flan-t5-small on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.0578
Rouge1: 84.443
Rouge2: 80.3833
Rougel: 84.4646
Rougelsum: 84.4359
Gen Len: 12.1859

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 5

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
No log	1.0	266	0.0936	79.5741	73.0518	79.6566	79.6486	12.0859
0.4018	2.0	532	0.0670	83.6269	79.3655	83.6546	83.6338	12.1887
0.4018	3.0	798	0.0596	83.4438	79.1374	83.4652	83.5119	12.2239
0.0771	4.0	1064	0.0600	84.8381	80.8793	84.8927	84.9041	12.1549
0.0771	5.0	1330	0.0578	84.443	80.3833	84.4646	84.4359	12.1859

Framework versions

Transformers 4.38.2
Pytorch 2.1.2
Datasets 2.18.0
Tokenizers 0.15.2

Downloads last month: 2

Safetensors

Model size

77M params

Tensor type

F32

·

Inference Examples

Text2Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for salbatarni/flan-t5-small-asap_t4_f1_prompt_adherence

Base model

google/flan-t5-small

Finetuned

(297)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard