yanyiwu / cppjieba

"结巴"中文分词的C++版本
MIT License
2.6k stars 691 forks source link

结果任然是一个字一个字,例如:我爱中国,结果出来为 我/爱中/国 #130

Closed 18360939479 closed 1 month ago

18360939479 commented 5 years ago

我运行的环境是vs2010,前面有个兄弟他是2017编译的会存在中文编码格式不符合问题,但是vs2010中文编码格式就是utf-8,求教

18360939479 commented 5 years ago

已解决。2010默认编码方式是GB2312,文本需要转换为UTF_8格式后可进行分词操作。如需控制台显示结果的需要将UTF-8格式再转换到普通string或char类型。方可显示

github-actions[bot] commented 1 month ago

This issue has not been updated for over 5 years and will be marked as stale. If the issue still exists, please comment or update the issue, otherwise it will be closed after 7 days.

github-actions[bot] commented 1 month ago

This issue has been automatically closed due to inactivity. If the issue still exists, please reopen it.