pascalrai
/

nep-summ-BART

Text2Text Generation

nepali text summary

Inference Endpoints

Model card Files Files and versions Community

pascalrai commited on Feb 29

Commit

e7c012a

•

1 Parent(s): 22131e4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -96,7 +96,7 @@ The model was pre-trained continuously on a single A10G GPU in an AWS instance f
 <br>Thus, hurts the performance of the Abstractive Summarization task.
 <br>This case is not present in the decoder-only model as all the predicted next token is not seen by the model at all.
-2. We have pre-trained our model with approx 16 GB of data, and testing Classification result on <a href='https://www.kaggle.com/datasets/ashokpant/nepali-news-dataset-large/data'>Nepali News Dataset (Large)</a> with a couple of transformer based Models available on Hugging Face,
 <br> Our models seem to do better than others with an accuracy of 0.58 on validation but,
 <br> It's seen that we still do not have enough data for generalization as Transformer models only perform well on large amounts of pre-trained data compared with Classical Sequential Models.

 <br>Thus, hurts the performance of the Abstractive Summarization task.
 <br>This case is not present in the decoder-only model as all the predicted next token is not seen by the model at all.
+2. We have pre-trained our model with approx 16 GB of data, and testing Classification result on <a href='https://www.kaggle.com/datasets/ashokpant/nepali-news-dataset-large/data'>Nepali News Dataset (Large)</a> with a couple of Nepali transformer based Models available on Hugging Face,
 <br> Our models seem to do better than others with an accuracy of 0.58 on validation but,
 <br> It's seen that we still do not have enough data for generalization as Transformer models only perform well on large amounts of pre-trained data compared with Classical Sequential Models.