razorx89 / roco-dataset

Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
175 stars 19 forks source link

Group images by archive before extraction #3

Closed saviola777 closed 5 years ago

saviola777 commented 5 years ago

Images are now grouped by archives prior to processing, to avoid downloading archives several times.

Error handling was added for several different scenarios and may seem to be a bit all over the place:

If downloading a package fails for X (-r parameter) times, the program execution will be halted because it is assumed that something is wrong with the FTP server (user was locked out, temporary downtime etc.). The other errors described above will only produce a log event because they are considered to be permanent.