pdfreader Search Results

1000+ results
for pdfreader

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

py-pdf/pypdf #2290

Text extraction throws IndexError on some PDFs

Recently I ran into a particular kind of pdf file from which I cannot extract text because the library throws an exception. ## Environment Which environment were you using when you encountered t…

sescobar99 updated 11 months ago
2
py-pdf/pypdf #1813

How to skip Tables and Images when parse PDF?

When i parse PDF files, i want to skip Tables and Images in PDF, because they may disrupt paragraph structure ## Environment ``` $ python -m platform Linux-5.15.0-69-generic-x86_64-with-debian-b…

Ontheroad123 updated 1 year ago
4
py-pdf/pypdf #2407

Missing pytest.mark.samples

Trying to run the tests when samples are not available ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform Linux-5.10.0-27-amd64-x…

kitterma updated 10 months ago
3
empira/PDFsharp #54

Encryption seems to break images that use indexed color

I have a PDF with an indexed color image of the company logo on every page. See attached for a similar document using an indexed version of the PDFsharp logo and which reproduces the issue ([indexed-c…

chrishaug updated 10 months ago
2
py-pdf/pypdf #2299

Unable to process pdf converted to bytes

I have just installed the package and tried uploading a file using form data, passing the file to PdfReader gives an error, > path should be string, bytes, os.PathLike or integer, not FileStorage"…

BrianMwas updated 12 months ago
4
run-llama/llama_index #8835

[Question]: How to Query RAG systems with Multiple prompt c…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question I want to query the RAG system using multiple questions/queries concurrently. I am us…

okoliechykwuka updated 1 year ago
2
py-pdf/pypdf #2268

Error when filling a value with parentheses

Using parenthesis causes "content stream" to be displayed ## Environment ```bash $ python -m platform Linux-6.2.0-34-generic-x86_64-with-glibc2.35 $ python -c "import pypdf;print(pypdf._d…

KanorUbu updated 1 year ago
4
LibrePDF/OpenPDF #763

Certain documents with shared objects/streams gets the modif…

**Describe the bug** For certain documents putting a visible signature in a page which has a /Contents with an array where one or more of the objects within it has the same number as an object in an …

netmackan updated 1 year ago
2
deepset-ai/haystack #5963

For Pdf file add title and subject in meta data

Hello, to have a more accurate retriever, i need to add some information in meta data (in my case title of document and subject). to do that i propose to add the method : ``` # Add title and…

warichet updated 11 months ago
4
py-pdf/pypdf #2368

UnicodeEncodeError: 'charmap' codec can't encode character …

I am new to Python and I am developing a program that takes a PDF file as input and converts it into text, I am using Python3 and tried both (PyPDF2 and PDFMiner.six) packages. for first pdf file it …

sunaarun updated 10 months ago
1

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for pdfreader

1000+ results
for pdfreader