-
olá,
desculpem se as issues não são o local apropriado para essa dúvida, mas estou com problemas para usar o parser. eu clonei o repositório, construí o projeto com `mvn package`, e estava tentando…
-
Hi there!
I'd be interested in parsing .docx. Is that of similar scope as this?
Ideally, I'd have a markdown to docx parser at some point:)
-
- Can it create a document by making a copy of an existing template? I've tried this without success:
document = Document('template.docx')
new_parser = HtmlToDocx()
html = sys.argv[…
-
I was trying out the tutorial. However, when partitioning the PDF provided in tutorial, I did not observe that the font-style of the text being stored in the Metadata for the element.
Is the font-s…
-
### Describe the bug
I wanted to use LlamaParse for parsing a set of documents (PDF/Doc/Docx) and index them to be able to ask custom questions to those documents. I performed a bunch of tests where …
-
### 🥰 Feature Description
It would be great to be able to upload documents, and for Lobe to parse them locally and provide answers.
### 🧐 Proposed Solution
Even if those documents aren't stored pe…
-
## Is your feature request related to a problem? Please describe.
When reading a `docx` file, it's really useful to understand where a paragraph is located within a document to create experiences a…
-
Error message:
```Python
Traceback (most recent call last):
File "thesis.py", line 194, in
htmldocx("index-utf8.html")
File "thesis.py", line 58, in htmldocx
new_parser.parse_html_f…
-
I've had the need where I have HTML with various different classess. When converting the HTML to .docx, I needed to map to classses to specific Word styles. I couldn't see an existing way to do it.
…
-
wiki page: "https://en.wikipedia.org/wiki/List_of_public_corporations_by_market_capitalization"
code:
from docx import Document
from htmldocx import HtmlToDocx
import codecs
file = codecs.op…