essepuntato / opencitations

OpenCitations provides in RDF accurate citation information harvested from the scholarly literature.
http://opencitations.net
ISC License
64 stars 3 forks source link

DOI with a final "(" #7

Open essepuntato opened 7 years ago

essepuntato commented 7 years ago

Some DOI retrieved and included in the corpus has a final " (" string (e.g. https://w3id.org/oc/corpus/id/121429) that has been extracted in some way by the BEE+SPACIN process. I would be important to understand if:

  1. it has been addressed and corrected in previous changes of the code
  2. it derives from the results provided by the sources (i.e. PubMed Central and Crossref)
  3. it is still a mistake introduced by the OpenCitations code
harej commented 7 years ago

Related issue: there are a lot of DOIs ending with periods, even though they're not supposed to. For example: https://w3id.org/oc/corpus/br/10172.html

davidshotton commented 7 years ago

Relates to Issues 21 and 24 about DOI-to-DOI tables.