Climate-Data-Science / Climate-Similarity-Metrics

Which similarity metrics are the most helpful to understand climate
0 stars 2 forks source link

Compute time delayed dependencies #24

Closed pierretoussing closed 4 years ago

pierretoussing commented 4 years ago

Another possibility is to look for dependencies over time.

pierretoussing commented 4 years ago

In the nature paper, they used the following time shifts:

pawelbielski commented 4 years ago

@pierretoussing What is the interpretation of the delay ranges, e.g. 1-2, 2-4? Does it make sense in our case to define delay in that way?

pierretoussing commented 4 years ago

@pawelbielski The delay ranges in the nature paper mean that they used all of this delays for the similarity computation e.g 2-4 means they used the delays 2, 3, and 4 for this one map. This has something to do with the similarity measures they used: Wavelet multi-scale correlation. This measure is designed for such this purpose. Using standard similarity measures, we would then have to define a method to combine the similarity values for 2, 3 and 4 months in order to combine them into one value for the delay range 2-4 months. So using standard similarity measures (and also implementing the approach Peter suggested), we define the time delays as a simple integer and then shift the whole map by this delay. I used 1, 3 and 12 months as delays. It means that the similarity is computed between the actual reference series and the whole map shifted by the respective time delay. I used the mean as fill value to assure the time series still have the same length (to avoid problems with similarity calculations). This delays act as example, I don't know if they are meaningful or if we should use other delays? I thought that the climate scientists would provide us with the delays they want to look at.

pierretoussing commented 4 years ago

Peter suggested that it would be better to drop points in order to keep the length of the time series equal, than filling up with the mean.