training corpus

#1
by ldwang - opened

Thanks for your paper.
As the paper said "training corpus is a mix of large-scale, medium-quality open-source datasets with permissive licenses."
Would you share details about training corpus, Thanks a lot.

Sign up or log in to comment