-
When i run
`python -m table_ocr.demo https://raw.githubusercontent.com/eihli/image-table-ocr/master/resources/test_data/simple.png`
i get
`pytesseract.pytesseract.TesseractError: (1, 'Error …
-
Hello,
Regarding 0.32.0 Major Enhancements : Removed CLI block on .exe installs to instead use dependency detection allowing Windows users to use CLI programs if installed
Does this mean, for Wi…
-
### Description of the bug | 错误描述
我的机器很差,内存只有40G,怕解析中途内存爆了,在解析一些5000多页的PDF的时候,我会先把PDF切成80页一个的小文件,然后再用MAGIC-PDF去解析。然后一大堆文件中偶尔会看到回显有如下日志这样的找不到图片的错误,一旦出现这样的错误,这个PDF就不会有任何layout或者markdown文件被输出。
不知道是不是…
-
**Describe the bug**
we plan to load historical pdf files into the database and want to make them searchable using OCR workflow, which changes the modification date of the file - hence the importan…
-
### Description of the bug | 错误描述
完成本地部署后提取文档出现错误
2024-11-19 19:01:46.298 | INFO | magic_pdf.pdf_parse_union_core_v2:pdf_parse_union:647 - page_id: 0, last_page_cost_time: 0.0
2024-11-19 19:01:…
-
* google-drive-ocr version: 0.2.6
* Python version: Python 3.12.2
* Operating System: macOS Sonoma 14.2.1
### Question
The token had expired, so I deleted the project and re-created a new proj…
-
**Describe the bug**
Output text from PDFs with columnar text doesn't match logical order in OCR mode. Columns would be mixed together like two halves of a card deck being shuffled.
**Files**
[…
-
# MindSpore OCR 服务化部署功能设计说明书
## 一、修订记录
| ***\*日期\**** | ***\*修订版本\**** | ***\*修改章节\**** | ***\*修改描述\**** | ***\*作者\**** |
| -------------- | ------------------ | ------------------ | ---…
-
Microsoft Windows [Version 10.0.19045.4529]
(c) Microsoft Corporation. All rights reserved.
C:\Users\lenovo\Desktop\python-hutrans-master>python setup.py install --user
Traceback (most recent cal…
-
Dear `ocrs` Contributors,
I hope this message finds you well. I've recently come across your OCR project and am thoroughly impressed with the vision and progress of `ocrs`. The use of machine learn…