pytorch / text

Models, data loaders and abstractions for language processing, powered by PyTorch
https://pytorch.org/text
BSD 3-Clause "New" or "Revised" License
3.49k stars 813 forks source link

One of the three datasets returned by Multi30k seems to be bugged. #2215

Open mcxiaoxiao opened 9 months ago

mcxiaoxiao commented 9 months ago

🐛 Bug

just like https://github.com/pytorch/text/issues/2001

I've done what the message suggested; I deleted the files manually and did it again, but the same error occurs.

To Reproduce Steps to reproduce the behavior:

Paste the following into a new Python file and run it.

the only difference is now its a different dataset

RuntimeError: The computed hash e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855 of /root/.torchtext/cache/Multi30k/training.tar.gz does not match the expectedhash 20140d013d05dd9a72dfde46478663ba05737ce983f478f960c1123c6671be5e. Delete the file manually and retry.