MichaelAquilina / Reddit-Recommender-Bot

Indentifying Interesting Documents for Reddit using Recommender Techniques
7 stars 0 forks source link

Consider sending pull request for Unicode fix to Goose #91

Closed MichaelAquilina closed 10 years ago

MichaelAquilina commented 10 years ago

goose/parsers.py lines 51 -59

@classmethod
def fromstring(self, html):
    html = encodeValue(html)

    if type(html) == unicode:
        self.doc = lxml.html.fromstring(html.encode('utf8'))
    else:
        self.doc = lxml.html.fromstring(html)
    return self.doc