takuyaa / kuromoji.js

JavaScript implementation of Japanese morphological analyzer
832 stars 117 forks source link

微笑み is broken down to 微 and 笑み #36

Open louy2 opened 5 years ago

louy2 commented 5 years ago

I am trying to use kuroshiro with kuromoji to annotate Japanese lyrics with romaji. In that context, "微笑み" should be kept together to be convert to "hohoemi", but because it is broken down to "微" and "笑み", the romaji conversion proceeds to output "bi emi".

cyfung1031 commented 1 year ago

This is not kuromoji issue. You can see it is correct in https://takuyaa.github.io/kuromoji.js/demo/tokenize.html