ZhangYuanhan-AI / CelebA-Spoof

[ECCV2020] A Large-Scale Face Anti-Spoofing Dataset
531 stars 92 forks source link

Here is a script for downloading the entire dataset using gdown in python #35

Closed thisisbhavin closed 3 years ago

thisisbhavin commented 3 years ago

I have written a little script to download all the files using python - View gist

thisisbhavin commented 3 years ago

If you are on EC2 (ubuntu machine) you can concat the downloaded files and unzip it -

atotest-git commented 3 years ago

Hello,

I have a question how to extract them on windows OS.

thisisbhavin commented 3 years ago

for windows -

That's it, it will unzip everything in one folder.

atotest-git commented 3 years ago

Thanks for telling, but i cannot download the whole files, if you have already extracted version of the dataset please let me know.

thisisbhavin commented 3 years ago

unfortunately no!

AIshutin commented 3 years ago

Hi! Have you faced the problem with the missing files in the dataset after downloading all zip parts, concatenating them and unzipping? For example, I have Data/train/940/spoof/167212.jpg missing . If so, how have you managed to solve the problem?

thisisbhavin commented 3 years ago

I did not bother to look at this then, but now that you have pointed out, I do see some missing files and folders. But, I'm not sure if that is because of unzipping process or if they weren't there to begin with. Need to look into it.

AIshutin commented 3 years ago

As described in #9

7z x -aoa CelebA_Spoof.zip.001

works perfectly for me. Thus, I think it's better to change the script.

brookekelseyryan commented 3 years ago

when running the script I get this error:

Too many users have viewed or downloaded this file recently. Please
    try accessing the file again later. If the file you are trying to
    access is particularly large or is shared with many people, it may
    take up to 24 hours to be able to view or download the file. If you
    still can't access a file after 24 hours, contact your domain
    administrator. 
beyza-yildirim commented 3 years ago

when running the script I get this error:

Too many users have viewed or downloaded this file recently. Please
  try accessing the file again later. If the file you are trying to
  access is particularly large or is shared with many people, it may
  take up to 24 hours to be able to view or download the file. If you
  still can't access a file after 24 hours, contact your domain
  administrator. 

This is because of google drive/gdown putting a limit to the number of downloads for a certain file, not related to the script, I would try again in 24 hours.

abhirajasp commented 2 years ago

For people facing issues successfully downloading the dataset, you can try #55