-
Importing via `mamoth.js` is going to always have issues as many of the docx styles and blocks are not supported.
See https://github.com/mwilliamson/mammoth.js/
Unfortunately, something more com…
-
### Question
... drawing_blip = element.xpath(".//a:blip")
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "src/lxml/etree.pyx", line 1600, in lxml.etree._Element.xpath
File "src/lxml/xp…
-
**Is your feature request related to a problem? Please describe.**
docx is a common document file type and it's much easier to parse than pdf (even for the most basic text extraction). Hope someone c…
-
Develop a formatter to parse PDF and DOCX files, extract text and tables while handling complex layouts.
- [ ] Research methods of text extraction from PDF and DOCX.
- [ ] Implement Basic Parsing …
-
### question
when i use WordFormatOption() function to define a converter,i found that i cannot get the picture info ,i see the code of class SimplePipeline,l found that there is no property to set g…
-
Hi! 👋🏻 The author of [docx-template](https://github.com/xamgore/docx-template) here.
I am integrating your crate with mine, as I believe you've got a pretty convenient API for building DOM nodes 👍🏻…
-
**Explain the problem.**
I have observed that in the `word/document.xml` file of the definition_list.docx test document — available at [test/docx/definition_list.docx](https://github.com/jgm/pandoc…
-
Requests:
- [ ] No caption in all .docx exports
- [ ] Titles are left-justified (not centered with table)
- [ ] Linebreaks aren't applied to "header" content.
Nice to Haves:
- [ ] Horizontal pa…
-
### This issue is unique.
- [X] I have used the [search tool](https://github.com/ONLYOFFICE/DesktopEditors/issues?q=) and did not find an issue describing my bug.
### Operating System
Linux (DEB pa…
-
### Bug
The table representation appears misaligned between PDF and DOCX (depending on which one needs alignment, perhaps further formats are affected too).
### Steps to reproduce
The snippet…