adsabs / ADSIngestParser

Curation parser library
MIT License
0 stars 7 forks source link

Crossref parser is not capturing item_number tag #64

Open seasidesparrow opened 1 year ago

seasidesparrow commented 1 year ago

Describe the bug Some publishers, including Frontiers in... may field the page number of the article with the tag <item_number item_number_type="article_number">. As a result, page numbers are not being captured for Frontiers in Physics records coming from Crossref. This may not be generally true, as other Frontiers in... crossref entries do have pagination.

To Reproduce Use the crossref parser to parse the file /proj/adsnest/ads_completeness/Crossref2/doi/10.3389/./fp/hy/,2/01/8,/00/09/7//metadata.xml. The page number of this article should be 97, but it will not be listed in the resulting Ingest Data Model object.

Additional context Item_Number should be captured by the CrossrefParser, and added to Ingest Data Model in pagination["electronicId"] as a string.