mwilliamson / python-mammoth

Convert Word documents (.docx files) to HTML
BSD 2-Clause "Simplified" License
785 stars 121 forks source link

Parsing issue with space #102

Closed we29758143 closed 3 years ago

we29758143 commented 3 years ago

I found out there is a mismatch while parsing bullet point, here is an example:

My original document:

image

Parsing document

image

There is a space between each section and its related sentence. However, there is a missing space in "Section 4". Can you tell me where can I fix it?

mwilliamson commented 3 years ago

Could you provide a minimal example document, as well as the expected and generated HTML (as text rather than images)?

mwilliamson commented 3 years ago

Closing since I don't think I can investigate without a document to look at.