scrapinghub / exporters

Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations
BSD 3-Clause "New" or "Revised" License
40 stars 10 forks source link

Filter for duplicates based on a single field #321

Closed wilfre closed 8 years ago

wilfre commented 8 years ago

When reading from a source that contains duplicates, currently there's no way to remove duplicates, this PR add a new filtering based on a single item's field. Some considerations about the filter are