Does this tool support chinese?

SimGus / Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

MIT License

315 stars 55 forks source link

Open RayShark opened 2 years ago

RayShark commented 2 years ago

I just wondering whether this tool support chinese corpus.

For example, do i suppose to use Jieba or other chinese tokenizer ? And is there interface reserved for chinese tokenizer...

Thanks a lot.

gaojunyu21 commented 2 months ago

是的，只需要将文件读取改为utf-8的格式读取即可