zhangchn / wiki2touch-standalone-ui

Automatically exported from code.google.com/p/wiki2touch-standalone-ui
1 stars 1 forks source link

indexer strips '&#173' (soft hyphen) for hyphenation #17

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. tried to introduce hyphenation into wikidump using PyHyphen
2. indexer only runs if use '­'; for '­' seg faults
3. articles.bin has all '­' stripped by indexer, so no hyphenation, too bad

What is the expected output? What do you see instead?
Would like to build articles.bin with hyphenation, using PyHyphen on wikidump

What version of the product are you using? On what operating system?
indexer from 0.65a_rebuild, on Macos, 10.6

Please provide any additional information below.
It would be extremely nice to have wikipedia articles shown with hyphenation, 
especially on small 
screens like Iphone. In Iphone-Safari this works well using the '­' or 
'&#173,' character. But I 
found no way to create articles.bin including one of these characters.

Original issue reported on code.google.com by matthiaskasemann@gmail.com on 28 May 2010 at 9:21

GoogleCodeExporter commented 9 years ago
Hi,
Would you please try this build:
http://wiki2touch-standalone-ui.googlecode.com/files/indexer_test-build_20100601
.tar.bz2

Looking forward to your reply.

Original comment by zhangchnxp on 31 May 2010 at 4:57

GoogleCodeExporter commented 9 years ago
Please try this instead of the indexer mentioned in last post.

http://wiki2touch-standalone-ui.googlecode.com/files/indexer_tb20100609.tar.bz2

The previous build would crash due to something incompatible with 64-bit 
environment of Snow Leopard.

Original comment by zhangchnxp on 9 Jun 2010 at 3:58