JianLoong / word-cloud-generator

D3 Word Cloud Generator
https://jian.sh/word-cloud-generator
MIT License
4 stars 9 forks source link

Improvements on the way the string is tokenized. #12

Closed JianLoong closed 1 year ago

JianLoong commented 1 year ago

Currently, the way the string input is tokenized is just using a simple split method.

Perhaps using a better tokenize library would give better results. For example here

Prashriya commented 1 year ago

Can I work on this?