Closed Eldo-rado closed 9 months ago
Hi, thanks for your attention to our work.
1. RSNA Pneumonia: We follow MRM to get the ground truth labels for the test set. The corresponding labels can be accessed here 2. SIIM-ACR Pneumothorax: We do not follow the approach of MRM which utilizes mmsegmentation. Our implement of fine-tune segmentation framework will be released soon. In detail, we only keep positive samples for segmentation (similar to MGCA).
I hope the above info is helpful!
Thank you for your prompt response! I can obtain the classification labels for the RSNA test set here, but where should I get the corresponding segmentation labels/masks?
There is no official RSNA test set segmentation labels released. Therefore, we follow the split of MGCA, randomly split the original training set into 16, 010/5, 337/5, 337 for training/validation/testing
Thank you, I have a general understanding.
During RSNA's classification, you followed the division according to MRM, but it seems inappropriate to directly use the results from the MGCA original paper here, as the divisions between MGCA and MRM are inconsistent. (I have tried using the MRM division on MGCA, and the metrics show some improvement 😂 )
Furthermore, may I inquire about the segmentation metrics at RSNA? The results from MedKLIP do not have specific annotations, it appears that they have not been fine-tuned and are directly taken from the original paper's results. However, I could not find corresponding values in the MedKLIP paper. Could you please clarify how you obtained these results? Perhaps it should also be fine-tuned again, as its division method is also unique.
Thank you for your understanding!
Got it, thank u ;-)
Hi, I apologize for the interruption again. I would like to confirm whether, during the reproduction of KAD in Table 2, the DQN module was set to be frozen? Specifically, was only the last linear layer enabled, or was only the backbone(resnet50) frozen while the other parts remained trainable?
We conduct linear classification with only the last linear layer trainable. We solely load the vision backbone parameters of the official released KAD model, then freeze the backbone.
👋 Hi! Thank you for your contribution, it is really a great job. And I have two questions regarding data processing:
RSNA Pneumonia: In the paper, it is mentioned, "The official data split is followed, with the training/validation/test set consisting of 25,184/1,500/3,000 images, respectively." I checked the RSNA Pneumonia dataset's official website and found that only 25,184+1,500 images have ground truth, and the remaining 3,000 images do not. Where can I find the ground truth for the test set?
SIIM-ACR Pneumothorax: When fine-tuning, is the approach the same as with MRM? Do you still include around 5,000 images without pneumothorax lesions for training, or is the final dataset for training reduced to only around 7,000 images (similar to MGCA), excluding those without lesions?