savasy's picture
Update README.md
a32cbb4
|
raw
history blame
841 Bytes

This checkpoint has been trained with the Turkish part of the MLSUM dataset where google/mt5 is the main Pre-trained checkpoint. SimpleT5 library is used for training.

Here is the code snippet for training

model = SimpleT5()
model.from_pretrained("mt5","google/mt5-small")

model.train(train_df=train2, # pandas dataframe with 2 columns: source_text & target_text
            eval_df=validation2, # pandas dataframe with 2 columns: source_text & target_text
            source_max_token_len = 512, 
            target_max_token_len = 128,
            batch_size = 8,
            max_epochs = 5,
            use_gpu = True,
            outputdir = "mt5_mlsum_turkish",
            early_stopping_patience_epochs = 0,
            precision = 32
)