For the current state of normalization function in utils.py, input could either be a single frame array or stacked array if multiple masks are provided. When more than one mask is provided, normalization step in training will take the percentile based on all pixel values across frames to perform min-max normalization, while for inference step only one image is passed in each time, so the normalization base is different. This needs to be corrected either by modifying the normalization function itself, or in the training script to make it consistent.
For the current state of
normalization
function inutils.py
, input could either be a single frame array or stacked array if multiple masks are provided. When more than one mask is provided, normalization step in training will take the percentile based on all pixel values across frames to perform min-max normalization, while for inference step only one image is passed in each time, so the normalization base is different. This needs to be corrected either by modifying thenormalization
function itself, or in the training script to make it consistent.