Hi, first I like to thank taku ku for his awesome mecab.
I'm training MeCab from scratch to make it analyse chinese sentences thanks to
this website http://www.onaneet.org/blog/archives/4020, but I have some
troubles while doing it.
First, I prepared the files
- dicrc
- char.def
- unk.def
- rewrite.def
- feature.def
as explained on onaneet.
Then I prepared a training corpus for chinese and used mecab-dict-index.
Everithing perfect here.
But, when making mecab-cost-train, if the training corpus has more than around
700 sentences, the program stops without any error on stderr.
The problem is that 700 sentences for a training is a bit small, isn't it?
And this is an unexpected bug...
I used the Windows version mecab-0.996.exe on a Windows Server 2008 R2 Standard
for 64x processor.
Original issue reported on code.google.com by lacam...@sinequa.com on 18 Jul 2013 at 7:53
Original issue reported on code.google.com by
lacam...@sinequa.com
on 18 Jul 2013 at 7:53