infinilabs / analysis-pinyin

🛵 This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.
Apache License 2.0
2.94k stars 547 forks source link

中文咖啡的咖分词后的拼音错误 #211

Open chen0401 opened 5 years ago

chen0401 commented 5 years ago

咖啡的咖怎么拼音是ga呀?

GET pinyintest/_analyze { "analyzer": "pinyinIndexAnalyzer", "text": "咖啡店" }

{ "tokens" : [ { "token" : "ga", "start_offset" : 0, "end_offset" : 3, "type" : "CN_WORD", "position" : 0 }, { "token" : "gafeidian", "start_offset" : 0, "end_offset" : 3, "type" : "CN_WORD", "position" : 0 }, { "token" : "gfd", "start_offset" : 0, "end_offset" : 3, "type" : "CN_WORD", "position" : 0 }, { "token" : "fei", "start_offset" : 0, "end_offset" : 3, "type" : "CN_WORD", "position" : 1 }, { "token" : "dian", "start_offset" : 0, "end_offset" : 3, "type" : "CN_WORD", "position" : 2 } ] }