Alex-Fabbri / Multi-News

Large-scale multi-document summarization dataset and code
Other
276 stars 53 forks source link

Questions about experiment #20

Closed AbnerCode closed 4 years ago

AbnerCode commented 4 years ago

Hi, Alex, Thanks for your public dataset. However, I have some questions. As mentioned by your answer in another issue, there are some different number of multi-document in the corpus, so I want to know the details of the experiment. When you are training, what numbers of documents will you use?

Alex-Fabbri commented 4 years ago

We will use all the documents but truncate to 500 tokens by taking the first tokens from each document. We provide more details in Section 6.3 of the paper.

AbnerCode commented 4 years ago

Thanks for your help!