lancopku / pkuseg-python

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
MIT License
6.55k stars 986 forks source link

词性输出错误 #174

Open Lumingous opened 1 year ago

Lumingous commented 1 year ago

按照给的测试样例测试 import pkuseg seg = pkuseg.pkuseg(postag=True) # 开启词性标注功能 text = seg.cut('我爱北京天安门') # 进行分词和词性标注 print(text)

输出结果是: [('我', 'B'), ('爱', 'I_end'), ('北京', 'B'), ('天安门', 'I_end')]

尝试了多个例子,均为tagIndex.txt下的内容,而非tags.txt中的真实词性