adsabs / ADSImportPipeline

Data ingest pipeline for ADS classic->ADS+
GNU General Public License v3.0
1 stars 12 forks source link

Avoid removing arXiv records with published counterpart not yet indexed #244

Closed aaccomazzi closed 4 years ago

aaccomazzi commented 4 years ago

Due to the timing of the ADS classic pipelines, it's possible that an arXiv paper will be matched to a published record on its way to being ingested (but not yet fully ingested). When this happens, the corresponding arXiv bibcode disappears from the list of canonical bibcodes even as the corresponding published bibcode is not yet active, leading to a loss of a record for a period of potentially multiple days.

We need to implement a check in the ingest process which prevents this situation from happening.

aaccomazzi commented 4 years ago

Fixed in https://github.com/adsabs/ADSImportPipeline/pull/245