zinking / bgweb

Automatically exported from code.google.com/p/bgweb
0 stars 0 forks source link

GBK encoding is still problem. #2

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. parsing a gb2312 encoded page
2. render the page using corresponding GBK
3. generally OK, but still not function ok on some special characters.

What is the expected output? What do you see instead?
no more errors on the detail page
currently 5.2% error occured during requests.

Please use labels and text to provide additional information.
seen a GBK character filter package on mico-blog project
use this filter to filter all encoding and convert encoding all to UTF-8

Original issue reported on code.google.com by zinking3 on 14 Oct 2010 at 1:54

GoogleCodeExporter commented 9 years ago
Need stacktrace to analysis the details of the  problem.
not what originally thought to use Q2B decocding techniques, not the problem
the result page has to be GBK encoded to suite some bbs servers

Original comment by zinking3 on 18 Oct 2010 at 8:43

GoogleCodeExporter commented 9 years ago
SOME ERROR LOG
<type 'exceptions.UnicodeEncodeError'>: 'gbk' codec can't encode character 
u'\xa0' in position 7359: illegal multibyte sequence
<type 'exceptions.UnicodeEncodeError'>: 'gbk' codec can't encode character 
u'\xa0' in position 7353: illegal multibyte sequence
Request was aborted after waiting too long to attempt to service your request. 
This may happen sporadically when the App Engine serving cluster is under 
unexpectedly high or uneven load. If you see this message frequently, please 
contact the App Engine team.

Original comment by zinking3 on 19 Oct 2010 at 5:22