thuml / Corrformer

About code release of "Interpretable Weather Forecasting for Worldwide Stations with a Unified Deep Model", Nature Machine Intelligence, 2023. https://www.nature.com/articles/s42256-023-00667-9
MIT License
157 stars 24 forks source link

Questions about the dataset #2

Closed ChujieXu closed 1 year ago

ChujieXu commented 1 year ago

Many thanks to the authors for opening up the source code and dataset.

I have a question about the dataset. In the Global wind/temp dataset, stations_sorted.npy seems to have sorted the stations. The tree structure of the network is also built directly based on the sorted stations. So if I were to apply Corrformer to other datasets, in what order should stations be fed into the network? Or what preprocessing do I need to do with other station datasets? I really hope to get help from the authors. Thank you very much.

wuhaixu2016 commented 1 year ago

Hi, thanks for your interests.

As stated in our paper, we adopt the hierarchical neighbourhood clustering (Algorithm 1 of Supplementary Information) to organize the stations.

For your own dataset, you need to find out the multiscale structure of stations firstly. Take the tree structure [S1, S2, S3] as the example. You can just view these stations to a plain sequence with size S1 S2 S3.

I think you can get a better understanding after reading this function: https://github.com/thuml/Corrformer/blob/main/layers/Multi_Correlation.py#L292

ChujieXu commented 1 year ago

Thank you for your reply, algorithm 1 in the supplementary information solved my confusion.