-
Uploading a PDF file and trying to OCR (method: simple, format : txt) by pressing button **Convert into Document** opens a new tab with the error **Not Found** and no file is downloaded
![image](h…
-
During some experiments, I noticed that sometimes Japanese characters are not correctly recognized. Not necessarily very complex characters, but simple and commonplace characters such as 津 gets recogn…
-
I used the following command:
`python3 -m manga_translator --mode batch -v --translator=none --inpainter=none --save-text -i `
This command generates the text files I need, but it still creates a ne…
-
It may be worth trying some alternative OCR libraries, as discussed in this article: https://www.statcan.gc.ca/en/data-science/network/character-recognition
Might be a good idea to have these alter…
-
如题
-
请问在训练ocr的时候,训练的数据必须和ocr识别结果是一样的结构吗?(特指换行)比如ocr识别结果如下:
“世界人民
大团结万岁”
是两行,我的构建数据的时候能不能是“世界人民大团结万岁” 一行呢? 这个换行是否为必须,我看原始数据里面有好多符号,难道换行是被要求的吗
-
### Title of the resource
OCR with Google Vision API and Tesseract
### Resource type
External Resource
### Authors, editors and contributors
Isabelle Gribomont, Liz Fischer, Ryan Cordell, Clemens…
-
2024-07-08 18:00:06,589-INFO:选定的策略:['qat_dis']
回溯(最近一次调用最后一次):
文件“run.py”,第 171 行,在
main()中
文件“run.py”,第 164 行,在 main
ac.compress() 中
文件“/root/miniconda3/lib/python3.8/site-packages/paddleslim/a…
-
### Is there an existing issue for the same bug?
- [X] I have checked the existing issues.
### Branch name
main
### Commit ID
de61009
### Other environment information
_No respons…
-
To repro, create a custom pipeline config with only one step: ocr-pdf. Then try to OCR a pdf. You get an empty download.txt.
My config looks like this:
![image](https://github.com/Stirling-Tools…