halleck1 / bzreader

Automatically exported from code.google.com/p/bzreader
0 stars 0 forks source link

Cannot Index Chinese Wikidump #20

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
i have try several dump of chinese wikipedia ,it always stop on indexing.
my os is Win7

Original issue reported on code.google.com by Blis...@gmail.com on 25 Mar 2010 at 10:46

GoogleCodeExporter commented 9 years ago
At what stage does it stop?  How long is the indexing estimated to take, and 
how long 
does it take before it stops?  Does the program hang (i.e. become unresponsive)?

Original comment by asaf.bartov on 7 Apr 2010 at 6:43

GoogleCodeExporter commented 9 years ago
it stopped at the stage of indexing .i found that there is no memory and index 
output 
of that process increased.i had waited 12 hours but the process didnt has any 
changes.it never stops,so i have to terminate it manual every time .it's seems 
to be 
the problem of chinese word segmentation.i had tried replace the Chinese 
steming module 
using ICTCLAS lucene.net version,but another bug raised ,i couldnt solve it .
FYI:http://blog.csdn.net/lgnlgn/archive/2009/07/24/4377116.aspx

Original comment by Blis...@gmail.com on 8 Apr 2010 at 4:49