pdf2text Search Results

shahrukhx01/multilingual-pdf2text #5

PDF Reading Got no output

INFO:multilingual_pdf2text.doc2img.parse_document:Parsing document from pdf to image INFO:multilingual_pdf2text.doc2img.parse_document:Unable to get page count. Is poppler installed and in PATH? INF…

UmerMehmood-Appsqueeze updated 3 months ago

clowder-framework/CONSORT-frontend #86

In one example, the title label was identified by the model but no page information was present in the PreviewDrawerLeft component "Page" button. Check if the predicted_csv output from the model has t…

minump updated 1 month ago

wri-dssg-omdena/policy-data-analyzer #63

Optimize PDF2Text Pipeline

ramanshgrover updated 3 years ago

dvdblk/hack4good-oecd #8

Implement translation in pdf2text

* Implement the module pdf2text/translate.py * The module should take a from_language and provide a public method `translate` which works on the document level (input and output are the entire documen…

V-G-spec updated 8 months ago

cyfreak/text-compare #2

automated PDF2text conversion

cyfreak updated 10 years ago

ocropus/hocr-tools #186

corrupted data when generating a searchable pdf with hocr-pd…

I am trying to generate a searchable pdf from a jpeg file and a hocr file with the help of hocr-pdf. I have both files in the same folder. `hocr-pdf . > out.pdf` generates a pdf but I cannot search…

pprw updated 1 week ago

saubhagya/pdf2text #1

Fatal Error in pdf.pdf2text.inc

This is the error i get: Fatal error: Call to a member function getDictionary() on a non-object in /home/s002003/public_html/PDF/saubhagya/pdf.pdf2text.inc on line 109 In pdf2text.php i have try to s…

Exadra37 updated 11 years ago

deanmalmgren/textract #424

Pdfminer on Windows searches for pdf2text.py.exe

When extracting a PDF using the pdfminer method, it looks for an application called `pdf2text.py`, but the spawn package adds `.exe` to it automatically. Obviously this file doesn't exists, so it thro…

PeterTillema updated 2 years ago

thp/urlwatch #704

Pdf2TextFilter error handling

I wanted to use urlwatch like this: ```yaml --- url: https://donneespubliques.meteofrance.fr/donnees_libres/bulletins/BCM/202205.pdf filter: - pdf2text --- url: https://donneespubliques.met…

JulienPalard updated 2 years ago

euske/pdfminer #269

html or xml converter fails with TypeError: write() argument…

python pdf2txt.py -t xml -o output.xml -d %pdffilepath% fails with following error ``` Traceback (most recent call last): File "pdf2text.py", line 113, in if __name__ == '__main__': sys.exi…

Prasaddiwalkar updated 3 months ago

190 results
for pdf2text