deanmalmgren / textract

extract text from any document. no muss. no fuss.
http://textract.readthedocs.io
MIT License
3.89k stars 599 forks source link

fix test cases and travis CI #389

Closed traverseda closed 3 years ago

traverseda commented 3 years ago

Messed up the PR process a bit while trying to resolve, but not too big of a problem

traverseda commented 3 years ago

Well the test cases work now (I accidentally committed the fixes to master on the repo not my fork) but it's failing on what looks like a pep-8 test.

traverseda commented 3 years ago

Odd, tests are working locally. I'll take a look at this another day.

Hopefully this should solve a bunch of dependency related issues as we're a lot more liberal in what dependency versions we accept.

helder-mattos commented 3 years ago

@traverseda I guess the problem can be fixed with a different approach to installing pip in test environment. https://github.com/helder-mattos/textract/commit/b68f58414366a78bde541799f6649abf6472007c

But still, the reverted pdfminer version has a missing shebang on the pdf2txt.py script file, which causes an error when it's executed.

traverseda commented 3 years ago

@helder-mattos thanks!