Pub history parsing - Githubissues

gnott commented 5 years ago

In reference to issue https://github.com/elifesciences/issues/issues/4342 for preprint data.

Raw parser extracts additional types of basic tags.

parseJATS.pub_history() returns a list of OrderedDict(), one item for each <event> tag.

I followed the style of how we extract existing tags regarding the date values and uri. The raw content of the <event-desc> tag is available (which is useful in other libraries).

I threw in event_desc_html because the parser is good at converting to HTML. It didn't seem to be worth creating another function for that, and maybe the HTML is useful elsewhere too.

coveralls commented 5 years ago

Coverage increased (+0.004%) to 99.577% when pulling 319a17b6d35c3bb131eb24d83bd5339b32a8976f on pub-history into 9fc6b0fe1210ea9812d0f339dee389dec9d9c550 on develop.

coveralls commented 5 years ago

Coverage increased (+0.004%) to 99.577% when pulling c9ec1c158529e9f8cf6d17ed20008e76355eb046 on pub-history into 9fc6b0fe1210ea9812d0f339dee389dec9d9c550 on develop.

gnott commented 5 years ago

Thanks for approval, I will send it along for the next tasks.

elifesciences / elife-tools

Pub history parsing #296