elifesciences / decision-letter-parser

Parse docx file containing decision letter and author response content and produce output in other formats
MIT License
0 stars 0 forks source link

Caption parsing fix and asset xref tag fix #102

Closed gnott closed 4 years ago

gnott commented 4 years ago

Re issue https://github.com/elifesciences/issues/issues/5874

A recent article failed to generate XML from the .docx file, the reasons centred around a figure caption, how it was parsed, and how the asset xref tag highlighting logic was confused by trying to search for a blank figure label in paragraph text.

Code here includes the fixes and two test cases for these situations.