Closed Yanzhou-Jin closed 5 months ago
I think this is fairly apparent that when the h5py is throwing an error, it is likely that the corresponding file is corrupted. I don't think we should have check on all datasets. We now have supported a great number of them.
The terminal has been killed accidentally by an external interrupt when running code:
./ch train jsc-tiny jsc --max-epochs 10 --batch-size 256
After that the code will have following issues:
This can be fixed manually by removing the file inside a hidden folder './.machop_cache/dataset', the programme didn't report any issue about the incompleteness. Maybe it will be a good idea to check the integrity of the dataset before open files.