flax-community
/

roberta-hindi

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

hassiahk commited on Jul 18, 2021

Commit

c2e4bdf

•

1 Parent(s): a6af99c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -50,7 +50,7 @@ You can use this model directly with a pipeline for masked language modeling:
 ## Training data
-The RoBERTa model was pretrained on the reunion of the following datasets:
 - [OSCAR](https://huggingface.co/datasets/oscar) is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.
 - [mC4](https://huggingface.co/datasets/mc4) is a multilingual colossal, cleaned version of Common Crawl's web crawl corpus.
 - [IndicGLUE](https://indicnlp.ai4bharat.org/indic-glue/) is a natural language understanding benchmark.

 ## Training data
+The RoBERTa Hindi model was pretrained on the reunion of the following datasets:
 - [OSCAR](https://huggingface.co/datasets/oscar) is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.
 - [mC4](https://huggingface.co/datasets/mc4) is a multilingual colossal, cleaned version of Common Crawl's web crawl corpus.
 - [IndicGLUE](https://indicnlp.ai4bharat.org/indic-glue/) is a natural language understanding benchmark.