totoleo / ik-analyzer

Automatically exported from code.google.com/p/ik-analyzer
0 stars 0 forks source link

IKAnalyzer的两个问题 #47

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
1. 
Dictionary.addWords,当添加的词库太大(我这里是将近30万)时��
�会出现bug,20万左右时没有这个问题
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
    at org.wltea.analyzer.dic.DictSegment.lookforSegment(DictSegment.java:228)
    at org.wltea.analyzer.dic.DictSegment.fillSegment(DictSegment.java:199)
    at org.wltea.analyzer.dic.DictSegment.fillSegment(DictSegment.java:204)
    at org.wltea.analyzer.dic.DictSegment.fillSegment(DictSegment.java:204)
    at org.wltea.analyzer.dic.DictSegment.fillSegment(DictSegment.java:170)
    at org.wltea.analyzer.dic.Dictionary.addWords(Dictionary.java:119)

2. 
当词库从20万跃至30万时,分词速度急剧下降,这是什么原因�
��

环境为:win7 + jdk1.6
version: IKAnalyzer2012_u3.zip 

What version of the product are you using? On what operating system?

Please provide any additional information below.

Original issue reported on code.google.com by jaysoona...@gmail.com on 27 Mar 2012 at 11:05

GoogleCodeExporter commented 9 years ago
请设置你的jvm内存参数而后再做测试。
你所说的性能下降是因为你的jvm内存不足,在频繁的执行GC

Original comment by linliang...@gmail.com on 28 Mar 2012 at 1:25