NYU-MLDA / OpenABC

OpenABC-D is a large-scale labeled dataset generated by synthesizing open source hardware IPs. This dataset can be used for various graph level prediction problems in chip design.
BSD 3-Clause "New" or "Revised" License
107 stars 19 forks source link

Merging and Unziping 13chunks of Dataset #4

Open raykakarimi opened 1 year ago

raykakarimi commented 1 year ago

Hi, I downloaded the big dataset. When I merge (cat) the files and I want to unzip the final part I get the following error. "bad zipfile offset (local header sig):" I appreciate it if you provide the instruction for reconstructing final dataset. (merging and unzipping it).

animeshbchowdhury commented 1 year ago

Hi @raykakarimi ,

I will try to reproduce it on my side. And fix this over the weekend if the issue persists on my side. Thanks.

raykakarimi commented 1 year ago

thanks @animeshbchowdhury, I get the following error when I try unzip the merged file. extracting: ~/openabce/OPENABC-D/graphml/dft.zip bad CRC 439e1ec4 (should be 05839b9f) error: invalid zip file with overlapped components (possible zip bomb)

animeshbchowdhury commented 1 year ago

Hi @raykakarimi,

Looks like the site hosting the data have moved it to a different server possibly corrupting a chunk of data. I am replacing it with a backup which is functional. Please give it an attempt in next 48 hours.

liangrj2014 commented 9 months ago

Hi, I also encountered "bad zipfile offset (local header sig):" after merging and then unzipping the 14 chunks of dataset. Could you please help me with it? Thanks!

animeshbchowdhury commented 1 month ago

Hi @liangrj2014,

Will you be able to provide a log file? On my end, I'm unable to reproduce it, so it will help me if it is related to certain version mismatch of zip and/or some issue.

Thanks again.