Balancing dataset - Githubissues

If the entire dataset is used, variables containing 1/1 or 0/1 SeaIceConcentartion will dominate, thus skewing the targetpool of the model. As a counter, try to develop a "belt" of variables along the MIZ to reduce the number of 1s and 0s. The belt can be dynamical, i.e. following the MIZ with a thresholding distance calculated from ice-free areas and fast-ice areas. (Include patch if close to the MIZ and 0.1-0.9, else remove) A static belt, essentially reducing the total area-coverage in the dataset.

AreFrode / MScThesis_AreKvanum2022_SeaIceML

Balancing dataset #9