tensorflow / datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
https://www.tensorflow.org/datasets
Apache License 2.0
4.29k stars 1.54k forks source link

Checksum not matching when building the300w_lp dataset #5535

Closed albertxcastro closed 2 months ago

albertxcastro commented 2 months ago

/!\ PLEASE INCLUDE THE FULL STACKTRACE AND CODE SNIPPET

Short description I'm trying to get the the300w_lp dataset using

import tensorflow as tf
import tensorflow_datasets as tfds

dataset, info = tfds.load('the300w_lp', with_info=True, split='train')

but I get the following error:

NonMatchingChecksumError: Artifact https://drive.google.com/uc?export=download&id=0B7OEHD3T4eCkVGs0TkhUWFN6N1k, downloaded to /Users/albertxcastro/tensorflow_datasets/downloads/ucexport_download_id_0B7OEHD3T4eCkVGs0TkhUWFN6KF3egIz1jDXUd4a-SiFYeMLsQp-9ySO-6HOMfa9q4Bk.tmp.57ebca469dff4c418ef38e8eebe719ff/download, has wrong checksum:
* Expected: UrlInfo(size=2.63 GiB, checksum='250c366c417ad48f72522b629ff83dbb2b2a57945803ad4a530a696bb6b13ee7', filename='300W-LP.zip')
* Got: UrlInfo(size=2.36 KiB, checksum='0c1e02541fe1001a8b041053ababdd53520339ab70055f87e132aee976777392', filename='download')

Environment information

Reproduction instructions

tfds.load('the300w_lp', with_info=True, split='train')

If you share a colab, make sure to update the permissions to share it.

Link to logs

Expected behavior Load the dataset correctly

Additional context I am not sure if the change performed in this issue affected this