petermr / pyami

Semantic Reader of the Scientific Literature
Apache License 2.0
12 stars 9 forks source link

`pdf2html` does not obey `--maxpage` #6

Open petermr opened 1 year ago

petermr commented 1 year ago

py4ami 0.0.40 does not stop after maximum pages:

pm286macbook:junk pm286$ py4ami PDF --pdf2html pdfplumber --inpath /Users/pm286/projects/semanticClimate/ipcc/ar6/wg3/Chapter02/fulltext.pdf  --outdir /Users/pm286/misc --maxpage 10
WARNING:root: *** ARG_DICT {'version': False, 'command': 'PDF', 'flow': True, 'footer': 80, 'header': 80, 'inpath': '/Users/pm286/projects/semanticClimate/ipcc/ar6/wg3/Chapter02/fulltext.pdf', 'maxpage': '10', 'outdir': '/Users/pm286/misc', 'outstem': 'fulltext.flow', 'outform': 'html', 'pdf2html': 'pdfplumber', 'pages': 'all_pages', 'resolution': 400}
WARNING:root: *** ARG_DICT {'version': False, 'command': 'PDF', 'flow': True, 'footer': 80, 'header': 80, 'inpath': '/Users/pm286/projects/semanticClimate/ipcc/ar6/wg3/Chapter02/fulltext.pdf', 'maxpage': '10', 'outdir': '/Users/pm286/misc', 'outstem': 'fulltext', 'outform': 'html', 'pdf2html': 'pdfplumber', 'pages': 'all_pages', 'resolution': 400, 'outpath': PosixPath('/Users/pm286/projects/semanticClimate/ipcc/ar6/wg3/Chapter02/fulltext.html')}
 wrote html /Users/pm286/misc/fulltext.flow_0.html
 wrote html /Users/pm286/misc/fulltext.flow_1.html
 wrote html /Users/pm286/misc/fulltext.flow_2.html
 wrote html /Users/pm286/misc/fulltext.flow_3.html
 wrote html /Users/pm286/misc/fulltext.flow_4.html
 wrote html /Users/pm286/misc/fulltext.flow_5.html
 wrote html /Users/pm286/misc/fulltext.flow_6.html
 wrote html /Users/pm286/misc/fulltext.flow_7.html
 wrote html /Users/pm286/misc/fulltext.flow_8.html
 wrote html /Users/pm286/misc/fulltext.flow_9.html
 wrote html /Users/pm286/misc/fulltext.flow_10.html
 wrote html /Users/pm286/misc/fulltext.flow_11.html
 wrote html /Users/pm286/misc/fulltext.flow_12.html
 wrote html /Users/pm286/misc/fulltext.flow_13.html