zenoxygen / bayespam

A simple bayesian spam classifier written in Rust.
https://crates.io/crates/bayespam
MIT License
13 stars 4 forks source link

Doesn't work with Unicode #2

Closed dmaahs2017 closed 3 years ago

dmaahs2017 commented 3 years ago

Consider using Unicode-Segmentation crates split_at_word_bounds function to break up into unicode words in your load_word_list function.

zenoxygen commented 3 years ago

Hi @dmaahs2017, thanks for your contribution. Unicode support was added in the last version (v1.1.0).

dmaahs2017 commented 3 years ago

Awesome, thanks!