adsabs / ADSIngestParser

Curation parser library
MIT License
0 stars 7 forks source link

ELSEVIER: Maintaining roman numbers #108

Closed mugdhapolimera closed 1 month ago

mugdhapolimera commented 3 months ago

We need to maintain roman numbers and not convert them to regular numbers.

e.g., /proj/ads_abstracts/data/ELS/CONSYN.GEO/ELS.051324/0038-0717/S0038071700X00132/0038071795900365/0038071795900365.xml

mugdhapolimera commented 1 month ago

/proj/ads/abstracts/data/ELS/CONSYN.GEO/ELS.050224/0047-2484/S0047248409X0014X/S0047248409002310/S0047248409002310.xml

/proj/ads/abstracts/data/ELS/CONSYN.GEO/ELS.050224/0047-2484/S0047248409X0014X/S0047248409002310/S0047248409002310.xml

/proj/ads/abstracts/data/ELS/CONSYN.GEO/ELS.052024/0042-207X/S0042207X00X0228X/0042207X60900968/0042207X60900968.xml (number preserved, but bibcode is wrong)

Springer examples: /proj/ads_abstracts/sources/SPRINGER/files.done/JOU=10546/VOL=1988.43/ISU=1-2/ART=BF00153965/10546_2004_Article_BF00153965_nlm.xml

/proj/ads_abstracts/sources/SPRINGER/files.done/JOU=00034/VOL=1990.9/ISU=2/ART=BF01236443/34_2005_Article_BF01236443_nlm.xml

And a couple of others:

/proj/ads_abstracts/data/T+F/TF.061124/unct20.v210.i07/00295450.2024.2346452.xml

/proj/ads_abstracts/data/NATURE/npj/NPJ.020824/JOU=41586/VOL=2006.440/ISU=7082/ART=BF7082xib/41586_2006_Article_BF7082xib_nlm.xml