Improvements on the way the string is tokenized.

JianLoong / word-cloud-generator

D3 Word Cloud Generator

https://jian.sh/word-cloud-generator

MIT License

4 stars 9 forks source link

Closed JianLoong closed 1 year ago

JianLoong commented 1 year ago

Currently, the way the string input is tokenized is just using a simple split method.

Perhaps using a better tokenize library would give better results. For example here

Prashriya commented 1 year ago

Can I work on this?