lukashermann / hulc

Hierarchical Universal Language Conditioned Policies
http://hulc.cs.uni-freiburg.de
MIT License
62 stars 9 forks source link

task_ABC_D dataset can not be unzipped #9

Closed hk-zh closed 1 year ago

hk-zh commented 1 year ago

Hi, I tried using the dataset ABC to train the model. I downloaded the task_ABC_D.zip and tried to unzip it. I tried the commands unzip and 7z but both can not unzip it successfully. The error said 'start of central directory not found zip file corrupt'. I am sure the zip file has already been fully downloaded (518 GB).

mees commented 1 year ago

Have you tried to verify the zip file with the checksum instructions?

El vie, 30 dic 2022 a las 13:00, Hongkuan Zhou @.***>) escribió:

Hi, I tried using the dataset ABC to train the model. I downloaded the task_ABC_D.zip and tried to unzip it. I tried the commands unzip and 7z but both can not unzip it successfully. The error said 'start of central directory not found zip file corrupt'. I am sure the zip file has already been fully downloaded (518 GB).

— Reply to this email directly, view it on GitHub https://github.com/lukashermann/hulc/issues/9, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAGH2ZZ6DTQEI6YZYBLLST3WP3FE7ANCNFSM6AAAAAATM2XVBM . You are receiving this because you are subscribed to this thread.Message ID: @.***>

hk-zh commented 1 year ago

Have you tried to verify the zip file with the checksum instructions? El vie, 30 dic 2022 a las 13:00, Hongkuan Zhou @.>) escribió: Hi, I tried using the dataset ABC to train the model. I downloaded the task_ABC_D.zip and tried to unzip it. I tried the commands unzip and 7z but both can not unzip it successfully. The error said 'start of central directory not found zip file corrupt'. I am sure the zip file has already been fully downloaded (518 GB). — Reply to this email directly, view it on GitHub <#9>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAGH2ZZ6DTQEI6YZYBLLST3WP3FE7ANCNFSM6AAAAAATM2XVBM . You are receiving this because you are subscribed to this thread.Message ID: @.>

I haven't tried. If I use the command unzip task_ABC_D.zip, it returns the following message Archive: task_ABC_D.zip error [task_ABC_D.zip]: start of central directory not found; zipfile corrupt. (please check that you have transferred or created the zip file in the appropriate BINARY mode and that you have compiled UnZip properly) . If I use the command 7z x task_ABC_D.zip, it could unzip part of the files (around 5000 files). The left files will be recognized as the tail. ERRORS: Headers Error Unconfirmed start of archive WARNINGS: There are data after the end of archive Physical Size = 1259053692 Tail Size = 554050759013

hk-zh commented 1 year ago

Actually, I also can not unzip the tasks_D_D.zip archive with the unzip command in ubuntu OS. It complains the file is too large. I can only use the 7z command.

hk-zh commented 1 year ago

Maybe I can try task_ABCD_D.zip. Could I ask if the episodes of ABCD datasets are separated in the task_ABCD_D.zip or mixed?

mees commented 1 year ago

I would recommend running the checksum verification first, maybe your download was interrupted. Anyway, we will update the zip files in the coming days with updated language annotations and improvements for reducing their disk size, so you might want to check the dataset again soon if you do not need it urgently.

El vie, 30 dic 2022 a las 13:39, Hongkuan Zhou @.***>) escribió:

Maybe I can try task_ABCD_D.zip. Could I ask if the episodes of ABCD datasets are separated in the task_ABCD_D.zip or mixed?

— Reply to this email directly, view it on GitHub https://github.com/lukashermann/hulc/issues/9#issuecomment-1367898298, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAGH2Z2IVXZXW73O5EFQBKTWP3JXHANCNFSM6AAAAAATM2XVBM . You are receiving this because you commented.Message ID: @.***>

hk-zh commented 1 year ago

The problem might be the firewall in my organization. It corrupted the zip file. Thank you for your information.