ISWC-Reproducibility-Track / Paper_608

0 stars 0 forks source link

Example2 - Curation and Statistics #3

Open angelosalatino opened 4 years ago

angelosalatino commented 4 years ago

What is the best way to download wikidata_edges_20200504.tsv.gz within the docker instance from GDrive?

Is it possible to have using wget?

This is the same for Example3 - Reachability

@dgarijo

dgarijo commented 4 years ago

For some reason, google drive does not work well with wget. I think the best way is to click on the link, store it in your computer and mount it as a volume in your docker image.

To do this, let's assume that you downloaded the file into a folder in your desktop called /data you would have to run KGTK mounting the volume with a command like:

docker run -it -p 8888:8888 -v "Your_path_to_data":/data uscisii2/kgtk:latest /bin/bash -c "jupyter notebook --ip='*' --port=8888 --no-browser"

Then you'll find the file on the "data" folder within the image and you can access it from the notebook. Let me know if this works, otherwise I will do an example.

dgarijo commented 4 years ago

Btw, the file is 16Gb, which is a little big (it's the full wikidata). Keep that in mind please!

angelosalatino commented 4 years ago

Perfect. This solution worked very well.