Closed shm007g closed 2 years ago
Thank you for the report.
This is due to new features that require stitching docx runs together. Inside the document, Word breaks up words based on spellcheck tests, revision times, etc., so a paragraph might look like:
<w:p><w:r>Tw</w:r>o <w:r>w</w:r><w:r>or</w:r><w:r>d</w:r><w:r>s</w:r></w:p>
Recent versions of Docx2Python update this to
<w:p><w:r>Two words</w:r></w:p>
This allows for a lot of code simplification and also for newer features like text replacement. BUT, it does slow things down.
If you're looking for a fast, simple export, python-docx2text might suit your needs.
Thank you again.
-Shay
I run the new version of docx2python for my files, it run slower than last version I use.
I post my record here. I cost far too more times than the last version I use.
Machine: macmini 2018
test files
New Version (Need Python3.7)
Old Version