The linear interpolation can cause outliers which will be learned by the model
SST extrapolated over land does not make any sense physically but can be treated as a augmented method for this specific case (since the patterns are linear with the original data). [any research supports the air temperature?]
Q1. If land points are filled with land temperature the gradient will be high, but it may make the model learn the landfall conditions ? (There must be a tradeoff)
Q2. SST is not only the factor , humidity is also takes part here
Issue 1
Method 1
The missing values can be filled using linear extrapolation methods