jorahn / icy

data wrangling glue code
http://www.rcs-analytics.com/icy/index.html
MIT License
0 stars 0 forks source link

parsing html from zip #6

Closed jorahn closed 9 years ago

jorahn commented 9 years ago

Raised ValueError: "no text parsed from document". It seems like the ZipExtFile is read before being passed to read_html, resulting in an empty buffer b''.

Currently the only fix seems to be to set obj = obj.read() if type(obj) == zipfile.ZipExtFile.