Closed eting83646 closed 1 year ago
Hi there. For normal slides we processed them also and used them for cancer classification tasks (we performed cancer classification for BRCA and COAD instead). If I remember correctly, I think here you got 172 cases where in each case there could be multiple WSI images (see the last column of your screenshot). For the typing labels, you may obtain them from the clinical data (which can be obtained from official site). File name like nationwidechildrens.org_clinical_patient_esca.txt.
Thank you for your prompt reply, but I still have some questions remaining.
Thanks in advance.
For all benchmarking datasets and tasks, we use the magnification level of 20 and tissue slides to extract the patches (as diagnostic slides contain annotation labels).
For the ESCA typing labels, we have uploaded our label files under ./data/. It serves as the input of our pipeline for ESCA typing task.
Hi, may I ask how you got the 213 samples for the cancer typing task? Below is a screenshot of what I got when I access the TCGA-ESCA from the GDC portal. After filtering out the slides that are labeled normal, I only got 172 slides to work with. Could you provide the details of how you deal with slides that are labeled normal? Thanks in advance.