Open jimmylao opened 2 years ago
OK. Thanks @jimmylao . But, is it right to mask these zero values in training? specially when data is missing for more than 2 consecutive days. doesn't it give wrong sMAPE in training?
@rangaswamymr Good question.
In summary, based on 1 & 2, it is validate to exclude missing values in loss/error computing since the resultant loss/error tells the performance of true signals. However, if missing value exclusion will affect the distribution of the total population, say: systematically long period of missing values, then these missing values need to be retrieved (inferred) first before training. The training itself is not responsible for fixing missing values.
@ranga Please refer to function called "calc_loss" defined in model.py lines 258 & 259 are used to exclude zero-value ground truth in loss computing during training