Open acc-galenicum opened 2 years ago
Ok, self-response in case anyone wonders: I missed the part of the paper where it explains it. The extractive summary is created based on the highlights or the abstractive summary selecting the sentences from the text which maximize the ROUGE metric.
Excuse me, may I ask a question?
Hi, This might be a dumb question but I am not getting it.
This model is supposed to perform an extractive summarization process. But when I look at the raw data (cnn_stories), they provide a text with some highlights at the end (I assume this is the summary), but the problem is this highlights do not belong to the original text, so I don't understand the raw data.
To put a specific example I attach a story file. 00a308681faf9c82a0e62a89b21fcdefb84b88fa.txt
Anyone can help me out with this? Thanks in advance