Closed raehik closed 1 year ago
@raehik Suspects that this may be due to an issue with how we label data. Hoping that @arthurBarthe can clarify in the next development meeting.
Turns out it's to do with the subdomain regions in training_subdomains.yaml
-- some weren't overlapping with the region of the processed data. lat_min=-80 lat_max=80
works with existing subdomain regions.
Yes, just to clarify a bit more for others, training_subdomains.yaml contains the definitions of the 4 regions used for training (in the paper we do not use the global data for training but only 4 selected regions. We need the processed data to contain those 4 regions, which is why we were getting a bug with lat_min=-25 and lat_max=25. I will add info about training_subdomains in the Readme if it is not there.
Updated default data processing command and added a brief note in readme 6958f5e6b4fbd1a86e2b5a3147b55c6355b64ec8 .
Preparing data with the command from the readme:
Then using the following command to train, again from the readme:
Results in a kernel size mismatch error:
(This is still using
model1
, not the refactored Pytorch model.)