dennlinger / summaries

A toolkit for summarization analysis and aspect-based summarizers
MIT License
11 stars 0 forks source link

Add test set preserving deduplication method #52

Closed dennlinger closed 2 years ago

dennlinger commented 2 years ago

Currently, first deduplication primarily keeps the training samples, and might discard later test samples, which is less desirable.

dennlinger commented 2 years ago

Turned out to be surprisinlgy easy, solved with #53.