Since NLinear and DLinear only apply additional linear operations (subtract last and moving average) on top of Linear which does not include any non-linearity. We would get the same results if we're solving via OLS and may get slightly different results via GD since the optimization dynamics may be different due to different matrix compositions. But seems that it wouldn't be that different to serve as a valid inductive bias for the model (as also shown by the results). Please correct me if I'm wrong. Thanks.
Dear authors,
Since NLinear and DLinear only apply additional linear operations (subtract last and moving average) on top of Linear which does not include any non-linearity. We would get the same results if we're solving via OLS and may get slightly different results via GD since the optimization dynamics may be different due to different matrix compositions. But seems that it wouldn't be that different to serve as a valid inductive bias for the model (as also shown by the results). Please correct me if I'm wrong. Thanks.