szc19990412 / TransMIL

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification
325 stars 72 forks source link

Split rule about TCGA datasets #16

Closed DearCaat closed 1 year ago

DearCaat commented 2 years ago

Hello, Could you please provide more details on generating TCGA datasets? I found on the official website that TCGA-LUAD contains 585 cases and more than 1000 slides, and TCGA-RCC is similar.

szc19990412 commented 1 year ago

Hello, the division of the data set must first ensure that the different WSIs of a patient cannot exist in the training and testing sets at the same time. Therefore, for the TCGA dataset, patients need to be divided.