-
#### 问题描述 / Problem Description
PPStructure missing text that PaddleOCR do not miss
#### 运行环境 / Runtime Environment
- OS:
- Paddle:
- PaddleOCR:
#### 复现代码 / Reproduction Code
PaddleOCR(lang…
-
/home/spyndling/.local/lib/python3.10/site-packages/matplotlib/projections/__init__.py:63: UserWarning: Unable to import Axes3D. This may be due to multiple versions of Matplotlib being installed (e.g…
-
Generally I would love to have some bounding boxes come back with the text response. Primarily for highlighting locations in the original document where the text got pulled. Not sure exactly how I wou…
-
When i run
`python -m table_ocr.demo https://raw.githubusercontent.com/eihli/image-table-ocr/master/resources/test_data/simple.png`
i get
`pytesseract.pytesseract.TesseractError: (1, 'Error …
-
### Description of the bug | 错误描述
将magic_pdf.json中的"is_table_recog_enable":设置为true后,出现bug,若是false就不会出现此问题
### How to reproduce the bug | 如何复现
Traceback (most recent call last):
File "/data/Miner…
-
### Describe your problem
Thanks for your work. I have deploy the ragflow system in my own server.
However, when I upload pdf file (2 pages), it costs long time to parse it (more than 300 seconds…
-
I'm getting an error associated with the output sizing of a layer being too small:
It's possible that this is due to an error associated with loading saved weights:
However, after readin…
-
I am currently partitioning a docx file harnessing unstructured with the next input params:
```json
{
"filename": "document.docx",
"response_type": "application/json",
"coordinates": fals…
-
support table-ocr to csv will be nice
-
We use the below config to get the table ocr, but there is no way to get hocr of the image. can someone add this feature please?
`
d = os.path.dirname(sys.modules["table_ocr"].__file__)
…