NatLibFi / RecordManager

A metadata record management system written in PHP, intended to be used in conjunction with VuFind or another Solr-based discovery interface. Provides import, export, harvesting (OAI-PMH), normalization, deduplication and Solr index update functionality with support for multiple metadata formats. Also includes an OAI-PMH provider that can be used to access the data stored in RecordManager database. Functionality driven by simple command line programs for easy automation.
GNU General Public License v2.0
47 stars 31 forks source link

New Option for harvest to not pull deleted #84

Closed bpalme closed 2 years ago

bpalme commented 3 years ago

Hi, It would be nice to add in an option (or possibly an automatic configuration when reharvesting?) to the SierraAPI harvester that does not pull deleted records. When we do a full reharvest, we have to churn through 300K deleted records. Someday I'll try to do a PR.

Thanks, Brent

EreMaijala commented 2 years ago

I've added automatic skipping of deleted records during reharvest. I hope this helps!