akira-kurogane / furigana-injector

Automatically exported from code.google.com/p/furigana-injector
3 stars 1 forks source link

Paragraphs truncated on "&" characters #33

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Inject furigana to a paragraph with an "&" character.

What is the expected output? What do you see instead?
The text from the "&" is truncated.

Original issue reported on code.google.com by akira%ya...@gtempaccount.com on 8 May 2010 at 11:49

GoogleCodeExporter commented 9 years ago
Caused by not identifying mecab nodes of 'unknown' type. Fixed by passing on 
those names in the same way a non-kanji node is passed.

Original comment by akira%ya...@gtempaccount.com on 8 Jun 2010 at 1:20

GoogleCodeExporter commented 9 years ago
Found that the string "『日本国憲法前文』曲&歌" (occurs near bottom 
of 
http://ja.wikipedia.org/wiki/%E6%97%A5%E6%9C%AC%E5%9B%BD%E6%86%B2%E6%B3%95%E5%89
%8D%E6%96%87) still truncates.

Original comment by akira%ya...@gtempaccount.com on 10 Jun 2010 at 5:28

GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
Is this actually a server-side bug and was supposed fixed in current version of 
extension?
I tried some sites and found many are still having this problem, while some do 
fixed.
Thanks.

Original comment by innatere...@gmail.com on 10 Jun 2010 at 12:30

GoogleCodeExporter commented 9 years ago
Server-side bug it is- bad parsing of the "&" character in form-post input.

To fix- either rewrite the form parsing, or send in different format.

Original comment by akira%ya...@gtempaccount.com on 16 Jun 2010 at 2:58

GoogleCodeExporter commented 9 years ago
Fixed in form parsing. Consider format change for future. Will require a server 
that can parse the input in two ways for a while at least.

Original comment by akira%ya...@gtempaccount.com on 16 Jun 2010 at 4:01