snap-stanford / ogb

Benchmark datasets, data loaders, and evaluators for graph machine learning
https://ogb.stanford.edu
MIT License
1.89k stars 397 forks source link

Bad zip file error while extracting papers100M data #480

Open SakethVNS opened 1 month ago

SakethVNS commented 1 month ago

Hi, I have tried to download ogbn-papers100M data using "from ogb.nodeproppred import NodePropPredDataset dataset = NodePropPredDataset(name = 'ogbn-papers100M')" - Using this it doesn't download entire data and throws error during extraction

I have also tried to download from the link used by ogb library "http://snap.stanford.edu/ogb/data/nodeproppred/papers100M-bin.zip" using "wget --tries=inf http://snap.stanford.edu/ogb/data/nodeproppred/papers100M-bin.zip"; After many retries the zip file is downloaded, but when I extract the file manually or give the zip file in the path of NodePropPredDataset it throws CRC error

Attaching error shown during manual extraction:

Using 7-zip: Screenshot 2024-05-09 175025

Using unzip command in linux: Screenshot 2024-05-09 175131

Is there any way around this??