aboSamoor / polyglot

Multilingual text (NLP) processing toolkit
http://polyglot-nlp.com
Other
2.32k stars 338 forks source link

UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 4941: character maps to <undefined> #254

Open chelove4draste opened 2 years ago

chelove4draste commented 2 years ago

windows 10, python 3.6.8 pip install polyglot==16.7.4


  ERROR: Command errored out with exit status 1:
   command: 'G:\aihub\mainvenv\Scripts\python.exe' -c 'import io, os, sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\richard\\AppData\\Local\\Temp\\pip-install-x1qy942k\\polyglot_7f1d84d4b0084c59a896800cb1eb2e5a\\setup.py'"'"'; __file__='"'"'C:\\Users\\richard\\AppData\\Local\\Temp\\pip-install-x1qy942k\\polyglot_7f1d84d4b0084c59a896800cb1eb2e5a\\setup.py'"'"';f = getattr(tokenize, '"'"'open'"'"', open)(__file__) if os.path.exists(__file__) else io.StringIO('"'"'from setuptools import setup; setup()'"'"');code = f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' egg_info --egg-base 'C:\Users\richard\AppData\Local\Temp\pip-pip-egg-info-jarizxed'
       cwd: C:\Users\richard\AppData\Local\Temp\pip-install-x1qy942k\polyglot_7f1d84d4b0084c59a896800cb1eb2e5a\
  Complete output (7 lines):
  Traceback (most recent call last):
    File "<string>", line 1, in <module>
    File "C:\Users\richard\AppData\Local\Temp\pip-install-x1qy942k\polyglot_7f1d84d4b0084c59a896800cb1eb2e5a\setup.py", line 15, in <module>
      readme = readme_file.read()
    File "d:\___windowspc\py3.6\lib\encodings\cp1252.py", line 23, in decode
      return codecs.charmap_decode(input,self.errors,decoding_table)[0]
  UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 4941: character maps to <undefined>