The following datasets was built based on CLERC/document collection, removing all the CLERC/generation test.