adsabs / ADSIngestParser

Curation parser library
MIT License
0 stars 7 forks source link

fixed subscript, superscript, italics, bold tags for elsevier #101

Closed mugdhapolimera closed 4 months ago

mugdhapolimera commented 4 months ago

https://github.com/adsabs/ADSIngestParser/issues/99

mugdhapolimera commented 4 months ago

@seasidesparrow please note that the changes to base.py did modify the parsing for other parsers like jats. I haven't updated all of them yet, because I'd like your feedback to see if this is working well. Once we settle on a solution, I can modify other files.

seasidesparrow commented 4 months ago

The json output for els_missing_keywords_tag.xml is missing the abstract. The abstract is present in the xml file as the following

<ce:abstract id="abs0010" view="all" class="author"><ce:abstract-sec id="abssec0010" view="all"><ce:simple-para id="abspara0010" view="all">In this issue of <ce:italic>Chem</ce:italic>, Nagib and co-workers report an operationally simple approach for direct C&#x2013;H halogenation of arenes and (hetero)arenes through <ce:italic>in&#160;situ</ce:italic> activation of commercially available PhI(OAc)<ce:inf loc="post">2</ce:inf>. The resulting non-symmetrical iodanes offer chemists new tools for the mild halogenation of arene scaffolds with exquisite selectivity and tunability.</ce:simple-para></ce:abstract-sec></ce:abstract><ce:abstract class="teaser" id="abs0015" view="all"><ce:abstract-sec id="abssec0015" view="all"><ce:simple-para id="abspara0015" view="all">In this issue of <ce:italic>Chem</ce:italic>, Nagib and co-workers report an operationally simple approach for direct C&#x2013;H halogenation of arenes and (hetero)arenes through <ce:italic>in situ</ce:italic> activation of commercially available PhI(OAc)<ce:inf loc="post">2</ce:inf>. The resulting non-symmetrical iodanes offer chemists new tools for the mild halogenation of arene scaffolds with exquisite selectivity and tunability.</ce:simple-para></ce:abstract-sec></ce:abstract>

What we have in the tagged (online) data is:

In this issue of Chem, Nagib and co-workers report an operationally simple approach for direct C-H halogenation of arenes and (hetero)arenes through in situ activation of commercially available PhI(OAc)2. The resulting non-symmetrical iodanes offer chemists new tools for the mild halogenation of arene scaffolds with exquisite selectivity and tunability.