hrzhang1123 / DTFD-MIL

MIT License
123 stars 19 forks source link

How are there 8.3 million patches in TCGA Lung dataset? #13

Open Dootmaan opened 1 year ago

Dootmaan commented 1 year ago

Hi @hrzhang1123 and thank you for your great work. Recently I have downloaded TCGA-LUAD and TCGA-LUSC datasets and have used the patch tiling code in your repo (level 1, 256x256 patch, 0.8 threshold) but have found that there are only about 3 million patches. I noticed that this TCGA dataset has very few levels and the higher level is 4 times smaller on each dimension than the lower level.

Can you provide some more details about the preprocessing on TCGA dataset?