deanmalmgren / textract

extract text from any document. no muss. no fuss.
http://textract.readthedocs.io
MIT License
3.89k stars 599 forks source link

Sphinx supports python 3.5+ #296

Closed jpweytjens closed 5 years ago

jpweytjens commented 5 years ago

This update fixes the msg parser, updates the Travis CI build, fixes some tests and updates a badge on the readme.rst.

msg parser

The msg parser is updated to use the new syntax of extract-msg.

Travis CI

Travis now used Ubuntu Bionic which comes with python 3.7, as sphinx only supports python 3.5+. Changes have been to dependencies if the latest version no longer supports python 2.7. Finally, the deprecated pep8 package is replaced with pycodestyle.

Tests

Added pytest to the test suite. Fixed the audio, pdf and image file tests.