Closed sunyilgdx closed 10 months ago
We haven't found a publicly available version of STORIES yet. In our work, we follow the same methodology in the original paper of STORIES dataset (Section 5.3) to collect a set of documents with a similar size from the CommonCrawl corpus, and use that collected documents as part of our general corpus.
请问如何下载到CC-Stories语料库呢?