adsabs / ADSImportPipeline

Data ingest pipeline for ADS classic->ADS+
GNU General Public License v3.0
1 stars 12 forks source link

unique DOIs in the SOLR document #274

Closed aaccomazzi closed 1 year ago

aaccomazzi commented 1 year ago

Agree BUT for the case-insensitive part of it. DOIs are case-insensitive and although people are usually careful in maintaining the case as they propagate the metadata, it's possible that we could get case-varying versions of the same DOI, e.g. 10.48550/arXiv.2208.05471 and `10.48550/arxiv.2208.05471. We want just one of these to appear in our list. The code in the PR takes care of this.