Open DavidBruant opened 10 years ago
Massive thanks to @gmarty.
Take a look at https://github.com/NaturalNode/natural There are tokenizers (to split a sentence into words) for multiple languages.
To detect word groups (like "social network"), look at n-grams. Also look at inflectors (singular/plural variations).
For common roots, use stemmer like https://github.com/fortnightlabs/snowball-js
Maybe display like http://static4.businessinsider.com/image/4fb3df97ecad04ef32000003-610-/android-fragmentation-chart.png
Depends on #68
Massive thanks to @gmarty.
Take a look at https://github.com/NaturalNode/natural There are tokenizers (to split a sentence into words) for multiple languages.
To detect word groups (like "social network"), look at n-grams. Also look at inflectors (singular/plural variations).
For common roots, use stemmer like https://github.com/fortnightlabs/snowball-js