Closed bryanwong17 closed 2 hours ago
You can download all diagnostic slides of TCGA-RCC and TCGA-BRCA from GDC and then read the files from CSV files in the numshots folder.
Thanks! I was wondering if you have the complete list of all WSIs, including the labels for each dataset (not in the few-shot settings). Thank you!
Sorry, I don't have a complete manifest file because the data was downloaded so long ago, but all WSI names and categories used in this article are in tcga_brca.csv, tcga_lung.csv and tcga_rcc.csv files.
Thank you for your answer! I have one more question regarding the number of patches at 20x magnification per WSI. Is it correct that a single WSI could exceed 20k patches? I followed the same settings as the CLAM preprocessing code, but the number of patches seems much higher compared to other reported papers
Yes, the number of patches of some WSI at 20 magnification will indeed exceed 20k in part. I guess there is no problem with your operation.
Got it. Thank you for the confirmation!
Hi, thank you for your great work! I was wondering if you could share the manifest file for automatic download from the terminal for both TCGA BRCA and TCGA RCC. Alternatively, could you guide me on how to obtain the file? I would also appreciate it if you could provide the label distribution. Below is an example of the manifest file