-
Hi. GoBooDo is great, but if it create **searchable PDF**, it will get more great. I wrote the codo to do this.
# Description
This patch add `ocrPDF` method to `createBook` class. This method ap…
-
### Describe the issue
On debian, xml2rfc's self-tests are failing, as can be seen for example at: https://ci.debian.net/data/autopkgtest/testing/amd64/x/xml2rfc/30269516/log.gz
```
....F........…
-
-
PdfFileReader 变成了PdfReader
Get_page() 变成了pages[ ]
extractText变成了extract_text
-
Followed your step by creating environment, installed requirement.txt and created `.env`(without a file name, just the extension).
Ran the command `streamlit run app.py`
Scenario 1 (Some files…
-
Would it be possible to put a wheel on PyPi for this project? This was recommended to me here:
https://github.com/pyodide/pyodide/discussions/2056
I'd like to try to use PyPDF4 in a project that a…
-
Starter code is present in [here](https://github.com/SriPrarabdha/LegalBrain-VectorSearch/blob/main/scripts/DownloadPDF_Selenium.py)
1. Run a loop over this code and download pdf of all judgments
…
-
URL: https://www.il-fa.com/
Documents URL: https://www.il-fa.com/public-access/board-documents/
Spider Name: il_finance_authority
Agency Name: Illinois Finance Authority
See the [contribution gu…
-
Just found this fork/project after logging https://github.com/mstamy2/PyPDF3/issues/13 test case below is for PyPDF4.
I've seen a number of PDF files where the `title` attribute/property is reporte…
-
I am having a ligature issue with this PDF.
'fi', 'fl' and 'ff' characters are returning NULL
#598 is similar to this issue.
## MVCE: Code + PDF
```python
from PyPDF2 import PdfReader
r…