Xiyue-Wang / TransPath

GNU General Public License v3.0
245 stars 31 forks source link

Different slide numbers for TCGA-KICH/KIRC/KIRP in TCGA website and TCGA-RCC you mentioned in the paper #30

Open Shentl opened 1 year ago

Shentl commented 1 year ago

Hi, @Xiyue-Wang

In paper, we mentioned that "TCGA-RCC is a subset of TCGA for the classification of three subtypes of (TCGA-KICH), (TCGA-KIRC), and (TCGA-KIRP). There are a total of 884 FFPE WSIs, including 111 KICH WSIs, 489 KIRC WSIs, and 284 KIRP WSIs."

But in the TCGA webiste, TCGA-KICH has 121 slides, TCGA-KIRC has 519 slides, TCGA-KIRP has 300 slides (seach diagnostic slides). I found that KIRC/KIRP/KICH in TCGA website have more slides in TCGA-RCC you mentioned in the paper, and I found the same problem for the TCGA-NSCLC dataset.

So, is TCGA-RCC just the simple combination of TCGA-KICH/KIRC/KIRP, or it is another public dataset that chooses some slides in TCGA-KICH/KIRC/KIRP?

Xiyue-Wang commented 9 months ago
  1. TCGA-RCC is the simple combination of TCGA-KICH/KIRC/KIRP,we may have downloaded it and missed some pieces because of the internet