SimGus / Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito
MIT License
315 stars 55 forks source link

Does this tool support chinese? #54

Open RayShark opened 2 years ago

RayShark commented 2 years ago

I just wondering whether this tool support chinese corpus.

For example, do i suppose to use Jieba or other chinese tokenizer ? And is there interface reserved for chinese tokenizer...

Thanks a lot.

gaojunyu21 commented 2 months ago

是的,只需要将文件读取改为utf-8的格式读取即可