google-research / pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
Apache License 2.0
857 stars 71 forks source link

checksum error for downloaded data #7

Closed dipendra009 closed 2 years ago

dipendra009 commented 2 years ago

Hi, I am getting some checksum error when the code automatically downloads from google bucket. Here is the error message:

File "/usr/local/lib/python3.8/dist-packages/tensorflow_datasets/core/download/download_manager.py", line 343, in _handle_download_result raise NonMatchingChecksumError(resource.url, tmp_path) tensorflow_datasets.core.download.download_manager.NonMatchingChecksumError: Artifact http://images.cocodataset.org/zips/test2017.zip, downloaded to /root/tensorflow_datasets/downloads/images.cocodataset.org_zips_test2017KDQv8bPgQock_hnrTxvqZbYabAXdY4EN91CL7w8_GTo.zip.tmp.86dbce4c6d70492c9e71a03288cf434e/test2017.zip, has wrong checksum. This might indicate:

Could you please update the checksum so that we can use the code.

Regards, Dipendra

chentingpc commented 2 years ago

Thanks for reporting the error. This is a quite specific issue with dataset download module of tensorflow_datasets, so I'm not sure how much I'm able to help. As suggested in the error message, you could try to download the coco dataset manually to bypass the automatic downloading function of tensorflow_datasets which is causing the issue. Maybe before that, you could try something even simpler: delete the existing coco dataset files downloaded by tensorflow_datasets.