Chinese text support in text preprocessing

keras-team / keras-preprocessing

Utilities for working with image data, text data, and sequence data.

Other

1.02k stars 444 forks source link

Open xieliaing opened 5 years ago

xieliaing commented 5 years ago

Looks like there is no built-in support in Tokenizer for Chinese text parsing. It can be built using Jieba package, just need some coding work.

xieliaing commented 5 years ago

OK, I will come up with some code to close this.