noise phenomenon is present in the Sentinel-1 ESA Instrument Processing Facility (IPF) v2.9 SAR data, particularly in subswath transitions, visible as long vertical lines and grained particles resembling small sea ice floes
U-Net CNN architecture plus symetrical blocks of convolutional, pooling and upsampling layers in the encoder and decoder of the U-Net
There may be a trade-off between the level of detail and homogeneity—a larger receptive field increases the homogeneity of SIC predictions, but it also appears to reduce the level of detail in predictions
landfast ice predictions are still troublesome for the model
a suggestion in the literature is to increase the receptive field of the model, allowing it to predict a value of the output pixel, based on the information from a wider area in the input image.
preprocessing: Six scenes containing errors in the ice charts depicting open water as 100% sea ice are removed and scenes without sea ice are discarded to balance class distribution
testing: 23 scenes, deemed difficult by professional ice analysts, have been selected for testing.
class imbalance: how are the classes of sea ice concentration distributed? -> median frequency weighting scheme, weighted cross-entropy loss
In this study, SIC estimation is formulated as a classification problem with the (weighted) categorical cross-entropy as the loss function, which represents the dissimilarity between real distribution of labels and output distribution predicted by the model - instead of regression problem
there are random data augmentations of the dihedral group (rotation etc.) and affine transformations
training of about 80–90 epochs took approximately 22–24 h per model on two Nvidia TeslaV100 SXM2 32-GB graphics cards
instead of symmetrically doubling and halving the filters for each level, 16 in the initial and final levels, and 32 in the remaining levels are used. To both simplify and minimize the risk of overfitting to ambiguous SAR textures.
receptive field of the U-Net models is thus 44, 92, 188, 380, 764, 1532, and 3068 for 2–8 levels
The best performing model is number 10 with eight levels, a receptive field of 3068 pixels, and trained on NERSC noise correction, with a patch size of 768 × 768 with an overall R 2 -score of 86.34%
There may be a trade-off between the level of detail and homogeneity
[AI4Artic challenge dataset manual]()
dataset description: the images are not equally distributed through the seasons, the incidence angle of the sensor affects the amout of radar backscatter
the texture is very important for the image classification
the sea maps are done via polygons, in which each individual grid cell may deviate substantially, but in average it is ok (-> smoothing of the result)
the regions near the ice edge are most important, and the ice maps there are more accurate and more detailed
the inter-analysts differences show a up to 20% deviation, especially in the intermediate concentrations
distance to land is especially useful to mitigate land spill-over effects at the coastal areas
NaN values are 2, non-data or masked pixels are 255
attention u-net: can provide localized classification information as opposed to global classification
inception u-net (!): filter of multiple sizes on the same layer for images with large variations in shapes and sizes in the salient region. Can read high-level details across a spectrum of sizes and shapes without going too deep
recurrent u-net (!): can update the feature maps based on context from adjoining units
dense u-net: can have fewer channels as information is preserved btw layers
ensemble u-net: first high-level segmentation and then successive u-nets performing segmentation on smaller objects
The research group of the challenge tells us: