tii-racing / drone-racing-dataset

A fully-annotated, open-design dataset of autonomous and piloted high-speed flight
https://ieeexplore.ieee.org/document/10452776
MIT License
48 stars 9 forks source link

piloted zipfile is corrupted #5

Closed ChanJoon closed 1 month ago

ChanJoon commented 1 month ago

Thank you for your work.

As I followed the instructions in Release or ran data_downloader.sh, the process fails to unzip piloted.zip

To reproduce,

wget https://github.com/Drone-Racing/drone-racing-dataset/releases/download/v2.0.0/piloted_zipchunk01
wget https://github.com/Drone-Racing/drone-racing-dataset/releases/download/v2.0.0/piloted_zipchunk02
wget https://github.com/Drone-Racing/drone-racing-dataset/releases/download/v2.0.0/piloted_zipchunk03
wget https://github.com/Drone-Racing/drone-racing-dataset/releases/download/v2.0.0/piloted_zipchunk04
wget https://github.com/Drone-Racing/drone-racing-dataset/releases/download/v2.0.0/piloted_zipchunk05
wget https://github.com/Drone-Racing/drone-racing-dataset/releases/download/v2.0.0/piloted_zipchunk06
cat piloted_zipchunk* > piloted.zip
unzip piloted.zip

Then the results shows

Archive:  piloted.zip
fatal error: read failure while seeking for End-of-centdir-64 signature.
  This zipfile is corrupt.
unzip:  cannot find zipfile directory in one of piloted.zip or
        piloted.zip.zip, and cannot find piloted.zip.ZIP, period

I tried to repair the zipfile as follows:

❯ zip -FF piloted.zip --out fixed.zip
Fix archive (-FF) - salvage what can
 Found end record (EOCDR) - says expect single disk archive
Scanning for entries...
 copying: piloted/  (0 bytes)
 copying: piloted/flight-01p-ellipse/  (0 bytes)
 copying: piloted/flight-01p-ellipse/camera_flight-01p-ellipse.zip 
    zip warning: no end of stream entry found: piloted/flight-01p-ellipse/camera_flight-01p-ellipse.zip
    zip warning: rewinding and scanning for later entries

How to fix it? Or could you please look into the released zipchunks and update them?

Thanks!

MichaelBosello commented 1 month ago

Thank you for the interest in our dataset.

We cannot reproduce the problem at the moment. Which OS are you using? Did you try downloading the zip chunks again?

ChanJoon commented 1 month ago

Thank you for the interest in our dataset.

We cannot reproduce the problem at the moment. Which OS are you using? Did you try downloading the zip chunks again?

Thank you for your quick response.

I’m currently using Ubuntu 20.04. I will manually download the zip chunks from the GitHub website and let you know the results.

ChanJoon commented 1 month ago

Sorry for late response.

I brought more detailed inspection with manually downloaded zip chunks. (from the Release)

image

As you can see in the image, piloted_zipchunk01 shows a Zip archive type while others are Binary types.

Tried to open the piloted_zipchunk01 also failed. I think it's corrupted.

So could you please check those files again?

Or you can send me your file via my email (chanjoon.park@kaist.ac.kr). Then I will check on my Ubuntu 20.04.

Thanks!

ChanJoon commented 1 month ago

I solved. There was a no problem in your release.

cat piloted_zipchunk01 piloted_zipchunk02 piloted_zipchunk03 piloted_zipchunk04 piloted_zipchunk05 piloted_zipchunk06 > combined.zip # For specify the first header
cat piloted_zipchunk0* > piloted.zip
unzip piloted.zip # or combined.zip

Maybe the problem was due to my filesystem or wget. Manually downloading the files worked. (though I’m not sure why my previous shell script attempts failed.)