CottageLabs / LanternPM

Lantern meta repository for product management
1 stars 0 forks source link

Author manuscript detection via XML doesn't work in some cases where it should #120

Closed emanuil-tolev closed 7 years ago

emanuil-tolev commented 8 years ago

This article PMC4978464, fulltext http://www.ebi.ac.uk/europepmc/webservices/rest/PMC4978464/fullTextXML had its Author Manuscript detected via the HTML according to the notes:

Checked author manuscript status in EUPMC, returned Y_IN_EPMC_SPLASHPAGE

Expected note:

Checked author manuscript status in EUPMC, returned Y_IN_EPMC_FULLTEXT

Something's off with the XML author manuscript detection since I can see the pub-id-type="manuscript in the fulltext XML, and that's the attribute we're supposed to be looking for.

emanuil-tolev commented 8 years ago

Proposed a solution though, that wasn't that hard. Can't believe I didn't think of it before, but it's hard to without a specific failing example. I must've seen that code over 100 times in the last several months and didn't think of the single quote.