mlcommons / training

Reference implementations of MLPerf™ training benchmarks
https://mlcommons.org/en/groups/training
Apache License 2.0
1.57k stars 548 forks source link

Image Segmentation: Data Preprocessing Verification -- Checksum fails #545

Open nmcglo opened 2 years ago

nmcglo commented 2 years ago

The Image Segmentation (Pytorch UNet3D) benchmark relies on the KITS19 dataset. I've followed the instructions from the KITS19 dataset repository for downloading the dataset and have been trying to run the data preprocessing script (https://github.com/mlcommons/training/blob/master/image_segmentation/pytorch/preprocess_dataset.py)

The cases all pre-process just fine but I get an error when the verify_dataset() function is called. At least one of the cases (Case 00043 specifically) has an md5 checksum hash that does not match the expected checksum value from the mlcommons image segmentation repo (https://github.com/mlcommons/training/blob/master/image_segmentation/pytorch/checksum.json). I haven't exhaustively checked each of them but if I run my own md5 hash on these case files, a random sampling of 10 or so all matched the expected values but the hash for case 43 does not match.

I have downloaded the dataset using both download scripts a total of 7 times and get the exact same invalid checksum each time so it isn't a corrupted download (at least on my end).

mmarcinkiewicz commented 1 year ago

Hi @nmcglohon , is this still a problem for you? I'm going to take a look and try to repro early next week

mmarcinkiewicz commented 1 year ago

I am able to repro. I'll reach out to the dataset owners asking for clarification whether anything has changed.

nmcglo commented 1 year ago

Thanks, apologies for the delay in response - I was away last month.

sepzjh commented 9 months ago

I have the same problem, I get an error when the verify_dataset() function is called,Has this issue been resolved? or can i skip the function?