adsabs / ADSIngestParser

Curation parser library
MIT License
0 stars 7 forks source link

Mononyms aren't being handled correctly by the utils.AuthorNames module in some cases #88

Closed seasidesparrow closed 6 months ago

seasidesparrow commented 6 months ago

Describe the bug In certain records from Elsevier where authors with mononyms exist, the author names are being returned as null by Elsevier parser, so the author block in the ingest data record may have an affiliation, orcid, email address, and other attributes, the name will be missing. This will result in an empty author name field in the concatenated author string (classic %A field), and an author initial of "." in the bibcode if the individual is the first author.

To Reproduce Parse the file /proj/ads/abstracts/data/ELS/CONSYN.GEO.new/2405-8440/S2405844023X00024/S2405844023001676/S2405844023001676.xml with Elsevier parser. The resulting entry in the ingest data model will have the following as first author:

""" "authors": [ { "affiliation": [ { "affPubRaw": "Pediatric Surgery Division, Department of Surgery, Faculty of Medicine, Public Health and Nursing, Universitas Gadjah Mada/Dr. Sardjito Hospital, Yogyakarta 55281, Indonesia" } ], "attrib": { "email": "drgunadi@ugm.ac.id", "orcid": "0000-0002-4707-6526" } }, """

Additional context Add any other context about the problem here.