dennlinger / summaries

A toolkit for summarization analysis and aspect-based summarizers
MIT License
11 stars 0 forks source link

Duplication detection #38

Closed dennlinger closed 1 year ago

dennlinger commented 1 year ago

This PR adds functions to analyze the existence of duplications in a dataset. Notably, this does not attempt to remove such duplicates, since the exact procedure is ambiguous (see #34).

In summary, there are the following functions and additions: