-
Stemming from on [an email discussion](https://groups.google.com/g/islandora-dev/c/oPr1ZsJx-HA):
Hypercube currently uses pdftotext to extract text embedded in a PDF OR tesseract to perform OCR on …
-
https://github.com/phuoc-ng/react-pdf-viewer/issues/278
https://github.com/phuoc-ng/react-pdf-viewer/issues/276
https://github.com/phuoc-ng/react-pdf-viewer/issues/263
I get all 3 scenarios here …
-
I've run into an issue with modifying certain pdf files. The issue seems happen only with pdf files that were generated by a scanner. I'm successfully adding serial numbers to the bottoms of every pag…
-
![image](https://user-images.githubusercontent.com/4609956/199988849-564f073c-51a7-4239-9c96-4508886d45e8.png)
https://tb.plazi.org/GgServer/html/0F2A87F24B7CFF92ED9D7599F97CFC59
in the original…
-
Hi, I am trying to train GROBID to deal with German-language sociology of law scholarship. I have a collection of PDFs from four decades of journal issues. The older ones exists only as scanned images…
-
Realted to #17.
I wrote a bash script awhile ago (https://github.com/jlyon/ocr-anything) that would analize the mimetype headers on the document and:
- For images, run tesseract
- For image pdfs, spl…
jlyon updated
8 years ago
-
So this is a cool project, but it involves going into costco's website to download their PDFs.
Is there any flexibility to take a scanned/ocr'd image of the one given to you in store and doing a s…
-
We can make this tool a bit more generic so it can also be used to rename PDFs.
In the EXIF output of PDF documents I can for example see the following data:
![image](https://user-images.githubuse…
-
Introduce a new library type called Magazines aimed for Magazines. The library behaves in the following ways:
- [x] Dedicated parser with a limited set of naming conventions
- [x] PDFs will open w…
-
### Descriptive summary
A large part of our collection is our student newspaper, which is primarily scanned PDFs. It would be really nice to use the Universal Viewer to display these newspaper iss…