-
Please check if it satisfies the following requirements:
* It is a small project satisfying a particular scenario
* It is related to Aspose APIs
* It takes 8 or less hours for us to complete
* …
-
http://en.wikipedia.org/wiki/DjVu
DjVu is a computer file format designed primarily to store scanned documents.
-
**Overview of feature request**
Mirador 3 includes an internal search function, but requires a query endpoint. Determine the best way to add this to Islandora.
The UT Scarborough islandora …
-
Hi, I got some difficulties with the installation of the MSM.
I precisely followed the compilation instructions and got to the point where I had to do the make install for the DiscreteOpt, FastPDlib…
-
If you can reach Thomas Breuel, ask him to License his spec under open source license, preferably [CC0](https://creativecommons.org/publicdomain/zero/1.0/).
-
### Your Feature Request
Hello,
I'm using tesseract to OCR scans of German fraktur newspapers and fraktur documents from NARA. Usually the scans have a size of up to 1-2 GB. OCR with tesseract usu…
-
The pdf output is not correct for Devanagari script when using the 3.2.3 experimental version for tesseract 4.0.0alpha.
Please see attached zip file with input image, text, hocr and pdf output.
…
-
As of a few weeks ago, the Conestoga transcription platform has separate fields for page numbers (as shown in top-right corner of image below).
There are, as you can see, two fields. Most pages w…
-
ocrmypdf uses excessive memory for files for very high page counts (hundreds), enough it might consume all available on temporary storage on small devices, e.g. 100 MB PDF produces >2GB of intermediat…
-
Tesseract Version: v5.0.0-alpha.20190623
Platform: Windows 10 64-bit
Current Behavior: For the Thai language (almost) every individual character in hOCR output is a word
Expected Behavior: Words (…
FrkBo updated
2 weeks ago