Open RayShark opened 2 years ago
I just wondering whether this tool support chinese corpus.
For example, do i suppose to use Jieba or other chinese tokenizer ? And is there interface reserved for chinese tokenizer...
Thanks a lot.
是的,只需要将文件读取改为utf-8的格式读取即可
I just wondering whether this tool support chinese corpus.
For example, do i suppose to use Jieba or other chinese tokenizer ? And is there interface reserved for chinese tokenizer...
Thanks a lot.