Closed roman-mishchenko closed 1 year ago
@FabianIsensee would be the expert on this. Any ideas?
The baseline is trained as a 5-fold cross-validation. So when reproducing the results you need to respect the splits: you need to identify which cases where in the validation set of fold 0 and run prediction on those with fold0 only, then move on to fold1 etc. You cannot just run nnUNet_predict beause then you are essentially predicting training data Best, Fabian
Got it, and after the merge of predictions of all 5 folds, run evaluate_predictions
yes, exactly
Hi, I've noticed that my results with Baseline Model (Described here) are inconsistent with those described in this repository.
I used the unchanged KiTS21 Dataset and ran predictions with the 3d_fullres Baseline model without changes in weights. However, after running
sample_segmentations
andevaluate_predictions
, I got different results that are described in this repository and paper. All source code I left unchanged (that counts for seeds for randomization).Do you know any particular reason why this can happen?