csurfer / rake-nltk

Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
https://csurfer.github.io/rake-nltk
MIT License
1.06k stars 150 forks source link

results are duplicating keywords with score 1 instead of #66

Open jmorenoven2 opened 2 years ago

jmorenoven2 commented 2 years ago

Github1

The list of results gives duplicated keywords with score 1 . This happened after upgrading Anaconda , and after that I had to reinstall rake-nltk

Text to reproduce the error giving two times the keyword "solar" with score = 1: "spectroscopy of the globular cluster dip source x 1746 371 ngc 6441. we propose a 50 xmm observation of the dipping xray source x 1747 371 located in the globular cluster ngc 6441. this source exhibits highly energy independent dips, consistent with an abundance >150 times less than solar, which repeat every 5.7 hours, whereas the overall cluster abundance is only a factor 4 to 10 below solar. resolving this discrepancy is the prime goal of this proposal. this study requires the high throughput, good spectral resolution, and continuous coverage afforded by xmm"

esakru commented 1 year ago

I have the same issue, not only for score=1 but for others as well, using RAKE for german.