athalhammer / danker

Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.
GNU General Public License v3.0
56 stars 4 forks source link

Separate download from link creation (was: Error launching danker.sh) #11

Closed AlessandroCalio closed 4 years ago

AlessandroCalio commented 4 years ago

Hi, I'm interested in your project but I have this error.

I'm using Debian 10, I have created a venv and installed the packages necessary for running your project. What i'm doing wrong? Thanks in advance

image

athalhammer commented 4 years ago

Dear Alessandro,

It looks like you need to install the curl command (sudo apt-get install curl). I was assuming that it is available on most Linux systems.

Cheers Andreas

AlessandroCalio commented 4 years ago

Thanks a lot, I'm not a very experienced linux user. I noticed that I was having dependency issues with the curl package and reinstalled it. Now is seems working :) .

Sorry for the off topic but i also wanted to tell you that it would be very convenient to launch danker on user-chosen dumps. Is it already possible?

athalhammer commented 4 years ago

You mean specifying a date in addition to the language edition (e.g., 2019-06-01)?

AlessandroCalio commented 4 years ago

Exactly, or give the possibility to input an already downloaded dump.

athalhammer commented 4 years ago

This has been implemented now and a folder as well as an alternative dumpdate can be specified. E.g. for an old english dump:

./danker.sh en --project wiki --dump-date 20120101 --folder myfolder