deanmalmgren / textract

extract text from any document. no muss. no fuss.
http://textract.readthedocs.io
MIT License
3.89k stars 599 forks source link

Passing additional arguments to underlying library (e.g. antiword) #288

Open marcelo-dalmeida opened 5 years ago

marcelo-dalmeida commented 5 years ago

Hi

I would like to pass some flags to antiword command through doc_parser

In my specific case, I would like to pass the width argument so text inside cells are not broken in different lines (-w 0) (for text processing purposes)

e.g. without '-w 0' argument

image

with '-w 0' argument

image