adsabs / ADSIngestParser

Curation parser library
MIT License
0 stars 7 forks source link

SPRINGER: duplicating mathml tags and ignoring tex tags #107

Closed mugdhapolimera closed 1 week ago

mugdhapolimera commented 3 months ago

Springer parsing has both mathml and tex in most instances. We want to take mathml over tex (but tex if there isn't any mathml). As it is now, it is producing doubles of the math things.

See /proj/ads/abstracts/ingest/ADSManualParser/SoPh.test and AstL.test (%I gives the location of the files used for that run)