ncoudray / DeepPATH

Classification of Lung cancer slide images using deep-learning
489 stars 212 forks source link

The problem of images ,corresponding JSON and xml files download #39

Closed byebaibai closed 5 years ago

byebaibai commented 5 years ago

Hi, sorry to interrupt.

As shown in the first image, I have tried a lot to download the file from https://portal.gdc.cancer.gov/legacy-archive/search/f.

屏幕快照 2019-07-24 下午2 54 09

When I try to test the code, 0.1 Tile the svs slide images works well, but it seems that 0.2 Sort the tiles into train/valid/test sets according to the classes defined fails, which is shown in the second image.

屏幕快照 2019-07-24 下午3 09 04

I think I downloaded the wrong images, xml, and JSON files when the website has changed a lot.

Could you help me to find the data you use in this program?

Thanks a lot for your kindness!

ncoudray commented 5 years ago

Hi - It don't understand your issue: to me, it looks fine: it looks like you downloaded only 2 svs images, so when it sort them, it assigned them to the valid set, but if you want something in the train and test, you need to download more than 2 images. You can do it by downloading metadata and then using their gdc-client, it's faster than doing them one by one.

HTH, Nicolas