razorx89 / roco-dataset

Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
165 stars 17 forks source link

Support downloading moved archives #2

Closed saviola777 closed 5 years ago

saviola777 commented 5 years ago

It seems that archives are occasionally moved within the folder structure, so some of the links in the dlinks.txt will point to the wrong location for the archive. As a simple workaround, I'm using an API which returns the current location of the archive based on the PMCID to determine the correct location if a download fails.

In the future we could add a script to fix the links in the dlinks.txt files directly if the number of dead links gets too high (I'm not sure how frequently archives are moved).