mushorg / glastopf

Web Application Honeypot
http://glastopf.org
550 stars 170 forks source link

Fix displaying of comments with unicode characters #239

Closed katkad closed 9 years ago

katkad commented 9 years ago

like: Detta är

Hi, fix for https://github.com/glastopf/glastopf/issues/238

works for this comment for example: Detta är sannolikt den mest informativ artikel om detta ämne jag har läst på sistone. Jag håller med och du är på den punkten. Grattis till ett väl utfört arbete.

but only for /comments part of glastopf

http://localhost/ won't work. It looks there is also other problem.

right now, it would display: Detta \xc3\x83\xc2\xa4r sannolikt den mest informativ artikel om detta \xc3\x83\xc2\xa4mne jag har l\xc3\x83\xc2\xa4st p\xc3\x83\xc2\xa5 sistone. Jag h\xc3\x83\xc2\xa5ller med och du \xc3\x83\xc2\xa4r p\xc3\x83\xc2\xa5 den punkten. Grattis till ett v\xc3\x83\xc2\xa4l utf\xc3\x83\xc2\xb6rt arbete.

I tried these things, but obiously I am missing something:

import BeautifulSoup from BeautifulSoup import * display_comments = BeautifulSoup(general_comments.encode('utf-8'), convertEntities=BeautifulSoup.HTML_ENTITIES)

display_comments = general_comments.decode('string_escape')

lala = general_comments.encode('unicode_escape')

lala = general_comments.decode('ascii', 'ignore') lala2 = str(lala.replace('\', '\')) print lala2 display_comments = lala2.decode('string_escape')

landscape-bot commented 9 years ago

Code Health Code quality remained the same when pulling 4685e96 on CZ-NIC:comments_encoding into 0b1c062 on glastopf:master.

glaslos commented 9 years ago

LGTM