Details about experimental setup

Alex-Fabbri / Multi-News

Large-scale multi-document summarization dataset and code

Other

276 stars 53 forks source link

Hi Alex,

Thanks a lot for sharing the code and data. I am trying to evaluate on your dataset. however, there are some details which are not mentioned in the paper. I am wondering if you could provide answers to the following questions:

What's the maximum truncate length of summary (looks like it's 300)?
Which embedding are you using, or do you use pretrained word embeddings?
Do you use positional embedding?
Do you share the encoder and decoder vocab and vocab embedding?
What's the encoder/decoder vocab size (or do you use a minimal frequency to filter out low-freq words or tokens)?

Alex-Fabbri / Multi-News

Details about experimental setup #26