titipata / pubmed_parser

:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
http://titipata.github.io/pubmed_parser/
MIT License
559 stars 164 forks source link

#110 Parse text inside journal-title nodes. #143

Closed nils-herrmann closed 1 month ago

nils-herrmann commented 1 month ago

<italic> is a child element in journal-title:

<journal-title>
    <italic>
        In Vivo
    </italic>
    Models of Inflammation
</journal-title>

The complete text can be parsed with "".join(node.itertext())