dennlinger / summaries

A toolkit for summarization analysis and aspect-based summarizers
MIT License
11 stars 0 forks source link

Determine whether pre-splitting paragraphs is the right choice for temporal tagging #24

Open dennlinger opened 2 years ago

dennlinger commented 2 years ago

Given that we might want to add in temporal information at a later stage, it could be helpful to have the entire text together at that stage, just so we get any potential references/time anchors, which Heideltime could use to determine actual normalized results.

Best would be to check with a quick benchmark on sample data, to see how meaningful the difference would be. Schematically, it is much easier to determine the actual paragraphs first, and then individually process them.