csurfer / rake-nltk

Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
https://csurfer.github.io/rake-nltk
MIT License
1.06k stars 150 forks source link

Support for chinese #15

Closed cosmoschen94 closed 6 years ago

cosmoschen94 commented 6 years ago

Does rake-nltk fully support Chinese in the newest version?

csurfer commented 6 years ago

@cosmoschen94 : I don't believe nltk at the moment supports chinese stopwords and tokenization. As this package depends on nltk for tokenization and stopwords the package doesn't support chinese at the moment.

If I am wrong and nltk does support chinese tokenization please provide me with the link and the required details and I will add details of how to make it work to the README.

fighting41love commented 6 years ago

I have modified the codes which can support the Chinese now. I will upload the code to a repo soon.

Jiannan28 commented 5 years ago

Do we have chinese support now? or it's in another package? Thanks.