dspinellis / alexandria3k

Local relational access to openly-available publication data sets
GNU General Public License v3.0
79 stars 14 forks source link

Add function to correctly retrieve ORCID #39

Closed BasVerlooy closed 6 months ago

BasVerlooy commented 6 months ago

After using the dataset it was noticed that sometimes the ORCID comes with the URL prefixed like this:

Identifier Source="ORCID">https://orcid.org/0000-0002-4314-996X</Identifier> But sometimes it comes without URL like this: <Identifier Source="ORCID">0000-0001-4314-996X</Identifier>

With this change the ORCID is correctly retrieved from the XML files

dspinellis commented 6 months ago

Thanks! Law of Big Data Analytics: "Any sufficiently large data set will contain some garbage." https://twitter.com/CoolSWEng/status/1586626308330725378 https://twitter.com/CoolSWEng/status/1192147127281111040