uma-pi1 / kge

LibKGE - A knowledge graph embedding library for reproducible research
MIT License
765 stars 124 forks source link

Fail to preprocess Yago3-10 and DBPedia500 #251

Closed JothamWong closed 1 year ago

JothamWong commented 2 years ago

Hello,

Running the download_all.sh script successfully downloads the abovementioned datasets but runs into issue when processing them, with error "UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 1086: character maps to ".

I tried to fix the issue by adding an encoding="utf-8" argument to the read file but it did not fix the problem.

Thank you for assisting.

AdrianKs commented 2 years ago

Hello, i just cloned the newest version of the repository and ran bash download_all.sh in the data directory. Everything worked as expected. Data is downloaded and preprocessed. I am working on Ubuntu 20.

Which operating system are you working on?

JothamWong commented 2 years ago

Windows 10

rgemulla commented 2 years ago

@psychicmario This is most likely due to a mismatch of encodings in your setup. Please provide the full stack trace to see where the error actually arises.

Note that we generally do not support the Windows platform. Under Windows, consider using WSL instead.