titipata / pubmed_parser

:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
http://titipata.github.io/pubmed_parser/
MIT License
588 stars 168 forks source link

#119 Modify XPath to retreive tables and references. #142

Closed nils-herrmann closed 6 months ago

nils-herrmann commented 6 months ago

Tables

The XPath was changed to parse the tables. Now any table-wrap node under body gets parsed.

References

It seems that some articles contain references with the attribute citation-type instead of publication-type. The new implementation parses also references with the attribute citation-type.