Closed jexterliangsufe closed 1 year ago
Hi, thanks for introducing us this challenge problem. This problem involves both imputation and forecasting tasks.
Imputation (1) I think you can try TimesNet (https://github.com/thuml/Time-Series-Library/blob/main/models/TimesNet.py), which is the current SOTA model for time series imputation. (2) Considering the missing rate of this dataset is relative high, I think you can adopt the mean value to fill the missing points at the very beginning and then try the TimesNet for imputation by mask training. (3) If you want to further enhance the imputation performance, you can adopt the TimesNet several times. Each time is to train a new model to correct the last time imputation results.
Forecasting I think both TimesNet and iTransformer are promising.
Good luck to your project.
Thanks for your kind reply! Here are 2 questions.
My dataset contains 100,000+ nodes and each node contains 365 time steps. the missing rate of this dataset is > 0.5. At the mean time, missing is random. Missing is mainly because my target is to forecast a "rate" calculated by A / B and sometimes B = 0. My target is to forecast several future time steps of "rate". I have tried Temporal Fusion Transformer, DLinear and gnn-based spatio-temporal models but achieved so-so results. I would appreciate if you give me some suggestions on dataset preprocessing, missing value inputation and model selection?