pdf2txt Search Results - Githubissues

360 results
for pdf2txt

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

euske/pdfminer #97

if 'W' in obj and 'H' in obj: TypeError: argument of type 'P…

I am running this command on a 25 Mo input PDF file : ``` pdf2txt.py -S -t xml -o pdfMinerOutput.xml input.pdf ``` It crashes with this stack trace : ``` Traceback (most recent call last): File "…

jlegaye updated 8 years ago
1
FlagOpen/FlagData #19

safetensors_rust.SafetensorError: Error while deserializing …

我删除了pytorch==2.0.1 修改codescikit-learn==1.3.0为 scikit-learn==1.3.0，要不然这两个下载会报错，但是这种请路况下虽然执行 pip install -r requirements.txt 成功，但是执行python pdf2txt.py -i "input_path" -o "output_file" 命令时报错：safetensors_r…

ChengRuiLiang updated 6 months ago
2
euske/pdfminer #252

pdf2txt error in command line cannot match the files argumen…

#164 This issue still remains unresolved on Win10 - python 3.7.1 Has someone found a solution yet?

udaykapur updated 4 years ago
10
euske/pdfminer #217

pdf2txt.py -t xml, get words/lines instead of chars

I'm using pdf2txt.py -t xml to dump the coordinates of each character of a pdf. Is there a way to get coordinates about words and lines (instead of individual characters)? I tried with -A and -M -L…

xdsv updated 6 years ago
2
euske/pdfminer #249

AttributeError: 'PDFObjRef' object has no attribute 'decode'…

I am using pdfminer's pdf2txt.py to extract text from different pdf's. The algorithm works very well in a lot of scenarios, but I am getting this error and I'm not sure what I can do to get pdfminer t…

swoltron updated 1 year ago
5
pdfminer/pdfminer.six #743

KeyError: 'JBIG2Globals'

- A description of the bug Trying to extract images from a one page pdf, I found a key Error. The file is readable by pdf viewer like Okular or Evince - Steps to reproduce the bug. The command I …

paucazou updated 2 years ago
2
gwk/pdfminer3 #4

How to use this package to convert PDF to TXT？

Platform：Win10，Python3.7.0； I tried use **pdf2txt.py samples/simple1.pdf** ,but it open a .py file and no result.

JupiterXue updated 4 years ago
1
euske/pdfminer #43

Document -F boxes_flow

pdf2txt prints that -F boxes_flow is an option but it is not documented on the web page or in the manual.

dfc updated 10 years ago
1
euske/pdfminer #288

I can not find the output file

when I type pdf2txt.py C:\gropid\input\Attention.pdf -o output C:\gropid\output\ I got no result I am missing something?

Mayar2009 updated 4 years ago
4
euske/pdfminer #17

Incorrect bounding boxes in xml output using forced analysis

Using the command: python pdf2txt.py -t xml -A produces a verifiable error in bounding boxes. (Please email me for the pdf)

jervispinto updated 12 years ago
2

上一页 1...1 2 3 4 5 6 7...36 下一页

360 results for pdf2txt

360 results
for pdf2txt