pypdf2 Search Results - Githubissues

1000+ results
for pypdf2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

learningequality/ricecooker #312

PDF Files created with wkhtmltopdf cannot have their outline…

### Description Because of an extant bug in PyPDF2 https://github.com/mstamy2/PyPDF2/issues/193 trying to read the outline for a file generated in wkhtmltopdf results in an error. This means that l…

rtibbles updated 2 years ago
2
mlabonne/llm-datasets #3

How to create an instruction dataset from .pdf and .docx doc…

Hello I'm in the process of fine-tuning a Large Language Model (LLM) for an NGO and I need to construct an instruction dataset from .pdf and .docx documents containing information in text. The obje…

Vonewman updated 2 months ago
1
cycomanic/Menextract2pdf #23

Command not found

Hello, Thank you for developing this code, I know it will be invaluable once I can get it working. I'm working on a Mac and trying to execute the "menextract2pdf_overwrite.sh" command after navigat…

dfournier013 updated 3 years ago
3
alejandro-ao/langchain-ask-pdf #15

requirements.txt modification needed

Thank you Alejandro! I got chatbot to work on my Windows 11 PC with the following requirement.txt langchain==0.0.166 PyPDF2==3.0.1 python-dotenv==1.0.0 streamlit==1.18.1 faiss-cpu==1.7.4 alta…

ajavamind updated 1 year ago
3
morngrar/pdfbooktool #2

Refactor code.

I can see no reason for outputting intermediate files with this script. Code should be refactored into outputting straight to out.pdf.

morngrar updated 2 years ago
1
cgs/evernote #20

Syntax error in "evernote/setup.py", line 6

One of `pypdfocr` pre-requisites is evernote. When running `pip3 install pypdfocr` I get the following exception: ``` $ pip3 install pypdfocr ... Collecting evernote (from pypdfocr) Using cached ev…

ronbarak updated 6 years ago
3
adriacabeza/erudito #4

Great Project - Can't read pdfs though - PdfFileReader is de…

Excellent project, thanks for making this for us folks who struggle to put these things together. When I try to get it to read my files, I get error - PdfFileReader is deprecated and was removed i…

adiso06 updated 1 year ago
1
astefanutti/decktape #69

Embed presentation title in PDF

Is it possible to extract the title of a presentation and embed it in the exported PDF? I've tested a few of the generated PDFS using PyPDF2, and none of them have title metadata. I guess this may be …

abingham updated 7 months ago
3
tylerdq/pdfca #7

Improve performance

At least with .parquet, [there are opportunities](https://wesmckinney.com/blog/python-parquet-multithreading/) to improve speed and reduce disk usage with dataframe binaries via pyarrow's built-in thr…

tylerdq updated 5 years ago
1
py-pdf/pypdf #2443

pypdf creates invalid links with add_annotation since PyPDF2…

Many years ago I used pypdf to create links for a book of maps for our storm sewer system. I had an index page that had links to all of the other pages and each page had links to the page with the map…

rsinger417 updated 9 months ago
1

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for pypdf2

1000+ results
for pypdf2