pascalrai commited on
Commit
408b77f
1 Parent(s): 5af1940

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -98,7 +98,10 @@ The model was pre-trained continuously on a single A10G GPU in an AWS instance f
98
 
99
  2. We have pre-trained our model with approx 16 GB of data, and testing Classification result on <a href='https://www.kaggle.com/datasets/ashokpant/nepali-news-dataset-large/data'>Nepali News Dataset (Large)</a> with a couple of Nepali transformer based Models available on Hugging Face,
100
  <br> Our models seem to do better than others with an accuracy of 0.58 on validation but,
101
- <br> It's seen that we still do not have enough data for generalization as Transformer models only perform well with large amounts of pre-trained data compared with Classical Sequential Models.
 
 
 
102
 
103
  #### Authors:
104
 
 
98
 
99
  2. We have pre-trained our model with approx 16 GB of data, and testing Classification result on <a href='https://www.kaggle.com/datasets/ashokpant/nepali-news-dataset-large/data'>Nepali News Dataset (Large)</a> with a couple of Nepali transformer based Models available on Hugging Face,
100
  <br> Our models seem to do better than others with an accuracy of 0.58 on validation but,
101
+ <br> There could be two reasons for this:
102
+
103
+ - There is still room for improving the quality of the data.
104
+ - It's seen that we still do not have enough data for generalization as Transformer models only perform well with large amounts of pre-trained data compared with Classical Sequential Models.
105
 
106
  #### Authors:
107