Open valecarriero opened 3 years ago
@valecarriero kgtk import-wikidata
is very complex. I haven't see this error before. Can you share /Volumes/LaCie/wikidata_dump_json_29092021/latest-all.json.gz
with me on a Google Drive?
@valecarriero Import wikidata is complex. We have several releases that we can share with you so you don’t have to do it yourself. We can import the newest version and share with you.
@valecarriero
kgtk import-wikidata
is very complex. I haven't see this error before. Can you share/Volumes/LaCie/wikidata_dump_json_29092021/latest-all.json.gz
with me on a Google Drive?
I'm afraid I don't have enough space to share it! However, I downloaded it on Sept 29th, so I think it should be wikidata-20210927-all.json.gz
here: https://dumps.wikimedia.org/wikidatawiki/entities/20210927/
@valecarriero Import wikidata is complex. We have several releases that we can share with you so you don’t have to do it yourself. We can import the newest version and share with you.
It would be very useful to start working on wikidata with kgtk!
@valecarriero, do you mind giving a try a previous version of Wikidata that we have tested? last one I tried successfully is 20210104, which corresponds to this json file: https://drive.google.com/file/d/1c_yqDmM5qsKF64Ix9MSDKuAwnbRAcnjD/view?usp=sharing
If you are eager to test Wikidata out with KGTK, this is the file I produced after importing the previous file: https://drive.google.com/file/d/18VGq56BTOHU7ui_WkfcfzL-0hNAIZB0T/view?usp=sharing
We'll test it out with the newer file in the meantime.
I also got the same problem by running the kgtk import-wikidata script on macOS Monterey 12.0.1.
But, I could run the script on the Ubuntu system 20.04.3.
Hi, I'm writing here again because I need to work on the most recent version of Wikidata, so I wanted to know if this issue has been solved or not! If not (yet), would you be so kind to share with me the KGTK files for the latest version of Wikidata, as you have done with the 20210104 version? Thank you so much.
Hi @valecarriero I created this notebook and ran it on my mac laptop , https://github.com/usc-isi-i2/kgtk/blob/master/use-cases/import-wikidata.ipynb
Please give this a try. Meanwhile, a new version of Wikidata, Oct 27, 2021, is here: https://drive.google.com/drive/folders/1wsUsgPWOgOmHAqmS-eg45q9Im_-Ll5CX?usp=sharing
Describe the bug error and warning with
kgtk import-wikidata
AttributeError: Can't pickle local object 'run.<locals>.MyCollector'
UserWarning: resource_tracker: There appear to be 19 leaked shared_memory objects to clean up at shutdown warnings.warn('resource_tracker: There appear to be %d '
To Reproduce
Desktop (please complete the following information): OS: macOS Mojave 10.14.6
Additional context
-W ignore:semaphore_tracker:UserWarning
but it doesn't work (kgtk: error: unrecognized arguments: -W ignore:semaphore_tracker:UserWarning
)