aboSamoor / polyglot

Multilingual text (NLP) processing toolkit
http://polyglot-nlp.com
Other
2.32k stars 338 forks source link

urllib.error.HTTPError: HTTP Error 403: Forbidden #241

Closed apongos closed 3 years ago

apongos commented 3 years ago

After

pip install polyglot

I try to run

from polyglot.text import Text, Word

x ='word'
Word(x, language='en').morphemes

and I get this error: urllib.error.HTTPError: HTTP Error 403: Forbidden

TLDR pip install failed and downloaded polyglot-15.10.3 from cache instead of polyglot 16.7.4 This cached version of polyglot pointed to this url: https://whoisbigger.com/polyglot/index.json

The url should have been http://polyglot.cs.stonybrook.edu/~polyglot/

Downloading the tar and running python setup.py install resolved my issue :)

apongos commented 3 years ago

Perhaps related, I also tried downloading english embeddings

from polyglot.downloader import downloader
downloader.download("embeddings2.en")

And get this error [polyglot_data] Error loading embeddings2.en: HTTP Error 403: [polyglot_data] Forbidden

apongos commented 3 years ago

I'm looking at one of my working installations of polyglot and the download url is instead

http://polyglot.cs.stonybrook.edu/~polyglot/

I think someone replaced the url with a bad one?

MarcelinoV commented 3 years ago

@apongos Where does the http://polyglot.cs.stonybrook.edu/~polyglot/ link go? In the downloader file?

rp2839 commented 3 years ago

It can be changed in the Downloader.py file. Around line 364 - at least in the version I installed which was 15.10.3