MrtnMndt / meta-learning-CODEBRIM

Open-source code for our CVPR19 paper "Meta-learning Convolutional Neural Architectures for Multi-target Concrete Defect Classification with the COncrete DEfect BRidge IMage Dataset".
Other
67 stars 16 forks source link

Extracting dataset on Ubuntu 16.04 LTS #3

Closed mgpadalkar closed 2 years ago

mgpadalkar commented 2 years ago

Hi all,

I was facing problems extracting the dataset available on https://zenodo.org/record/2620293. @MrtnMndt helped me resolve it. I will describe the problem here followed by the solution that worked.

md5 of the downloaded zip files is correct, but upon extraction I got the following problems:

... Extracting classification_dataset_balanced/val/defects/image_0001304_crop_0000004.png Extracting classification_dataset_balanced/val/defects/image_0001129_crop_0000002.png Extracting classification_dataset_balanced/val/defects/image_0001126_crop_0000005.png Extracting __MACOSX/classification_dataset_balanced/val/defects/._image_0001126_crop_0000005.png Extracting classification_dataset_balanced/val/defects/image_0000300_crop_0000001.png

Sub items Errors: 7575


A closer observation showed the following for some files:
```bash
...
Extracting  classification_dataset_balanced/train/background/image_0000531_crop_0000005.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000324_crop_0000003.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0001311_crop_0000001.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000334_crop_0000005.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000030_crop_0000005.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000233_crop_0000005.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000020_crop_0000003.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000429_crop_0000004.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000969_crop_0000001.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000439_crop_0000002.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000001_crop_0000002.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000011_crop_0000004.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000202_crop_0000002.png     Unsupported Method
Extracting  classification_dataset_balanced/train/background/image_0000399_crop_0000003.png     Unsupported Method
...

@MrtnMndt pointed out that the operating system probably did not handle large zip file automatically (zip files beyond 4GB). He tried it on his Mac/window and the extraction was just fine and suggested I use a different extractor tool following https://askubuntu.com/questions/959256/cant-extract-a-large-zip-file.

The solution that worked

sudo apt-get install fastjar
jar xvf CODEBRIM_classification_balanced_dataset.zip

:heavy_check_mark: This is also the accepted solution on: https://unix.stackexchange.com/questions/438368/unix-unzip-is-failing-but-mac-archive-utility-works

:x: We also tried the following which did not work.

sudo apt-get install dtrx
dtrx CODEBRIM_classification_balanced_dataset.zip
MrtnMndt commented 2 years ago

Thanks for going through the effort to make this discussion public so others can benefit from it. Much appreciated!