Closed sun7852961 closed 5 years ago
I0124 16:51:52.078809 6469 tokenizer.cpp:73] Vocabulary load successfully! #vocab_size = 294657 Enter Document1: 我是风 Enter Document2: 马和牛 Jensen-Shannon Divergence = 0.0 Hellinger Distance = 0.0
可以检查下输入的分词结果是怎样的,如果句子分词后所有词均不在词表中,那么是会出现如上情况
I0124 16:51:52.078809 6469 tokenizer.cpp:73] Vocabulary load successfully! #vocab_size = 294657 Enter Document1: 我是风 Enter Document2: 马和牛 Jensen-Shannon Divergence = 0.0 Hellinger Distance = 0.0