-
Ubuntu 16.04.4 LTS (GNU/Linux 4.4.0-122-generic ppc64le)
tesseract -v
tesseract 4.0.0-beta.3-180-gab1f
leptonica-1.76.0
libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : lib…
-
**Describe the bug**
Hi.
Reporting this as requested. I'm not sure if it is the same cause as the other similar issues.
Scanner - Brother DS-640
```ocrmypdf | INFO - New file: /input/test.…
-
* Monica Berti (2019). "Historical Fragmentary Texts in the Digital Age." In ed. Berti, _Digital Classical Philology: Ancient Greek and Latin in the Digital Revolution_, pp. 257–276. Available: https:…
-
-
请回复 issue 进行文章推荐与招聘投稿,内容须与统计/数据科学相关,是否采纳取决于编辑部意见。
文章推荐包括学术论文、博客、书籍、教程或软件等的推荐,如果是英文文章在月报发布后自动作为翻译备选文章。
招聘主要面向学术界与工业界的招聘信息发布且岗位要与统计/数据科学相关。
文章推荐格式如下:
推荐语:(几句话就可以,可长可短,有态度不严肃)
推荐人:(建议用真名)
…
-
Use case from email.
User gave examples of DOIs for journal they access to, and can acces the PDFs in the browser, but via API calls can not access full text. The non-accessible via API DOIs appea…
-
@bcglee Thank you for your hard work,
Lets say that I detected Text-Lines, the issue is that Detectron2 will save the boxes randomly, without a layout structure.
Setting heuristic rules, example: …
ghost updated
4 years ago
-
We need to find a suitable scanning company to do the scanning of the IWPT proceedings. The physical copies are located at different places in continental Europe.
Questions relevant for selection:
…
-
Hi there,
Robert and I have written an OpenMM command line executable which is planned to be in the next release. It can be downloaded here:
https://github.com/rmcgibbo/openmm-cmd.git
I've also ma…
-
I am wondering if it is possible to use PDF files instead of text files when writing to db? As far as I checked there is no built-in capability in the `write_documents_to_db` to handle it. Is it on th…