Closed amritkrs closed 9 years ago
matej@mitmanek: ~$ html2text http://en.wikipedia.org/wiki/Monty_Python>/dev/null ; echo $?
0
matej@mitmanek: ~$
Get the updated code from https://github.com/Alir3z4/html2text or from my repository http://luther.ceplovi.cz/git/html2text.git. This repository is literally dead, because its author is.
@mcepl Thanks bro.
@mcepl inspite of using html2text from https://github.com/Alir3z4/html2text i still face the same problem. r = requests.get("http://en.wikipedia.org/wiki/Python_%28programming_language%29") print html2text.html2text(r.content)
Traceback (most recent call last):
File "
File a bug to @Alir3z4 then.
OK
r = requests.get('http://en.wikipedia.org/wiki/Monty_Python') print html2text.html2text(r.content) Traceback (most recent call last): File "", line 1, in
File "html2text.py", line 812, in html2text
return h.handle(html)
File "html2text.py", line 254, in handle
return self.optwrap(self.close())
File "html2text.py", line 266, in close
self.outtext = self.outtext.join(self.outtextlist)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 4: ordinal not in range(128)