ageitgey / node-unfluff

Automatically extract body content (and other cool stuff) from an html document
Apache License 2.0
2.15k stars 223 forks source link

Add Bangla & Vietnamese stopwords #107

Open muiton opened 4 years ago

muiton commented 4 years ago

Hi,

I am from Bangladesh and wanted scrap websites in Bangla. This library didn't had any Bangla stopwords, so I added them and test on few Bangla sites.

The Vietnamese stopwords came from issue #101 and I send the file to Vietnamese friend and said it's working fine....

Please merge this so that others can use these two languages.