adsabs / ADSIngestParser

Curation parser library
MIT License
0 stars 7 forks source link

Copernicus parser is not capturing <eissn> correctly #85

Closed seasidesparrow closed 6 months ago

seasidesparrow commented 6 months ago

Describe the bug Copernicus XML holds the electronic ISSN within an <eissn> tag. The code for the parser looks for the <eissn> tag, but then extracts data from the <issn> tag instead. As a result the electronic issn is being populated by whatever is stored in <issn> within the xml document.

To Reproduce Try parsing any of /proj/ads/abstracts/data/EGU/EGU.012924/wcd*

Additional context See lines 60-61 in adsingestp/parsers/copernicus.py: https://github.com/adsabs/ADSIngestParser/blob/0938a1ce5bd01f0b4ebefe67e2d69e75b5f1d32d/adsingestp/parsers/copernicus.py#L60

seasidesparrow commented 6 months ago

Fixed by #86