khundman / telemanom

A framework for using LSTMs to detect anomalies in multivariate time series data. Includes spacecraft anomaly data and experiments from the Mars Science Laboratory and SMAP missions.
https://arxiv.org/abs/1802.04431
Other
1.02k stars 248 forks source link

Question relating to 'num_values' and rows for each dataset #70

Closed conroyjb closed 2 years ago

conroyjb commented 2 years ago

I'm confused about the relationship between the number of rows in a given data.h5 file and the number of values in the labeled_anomalies row for that file.

e.g. channel P-1 has 2872 rows but 'num_values' for this data 8502, 'num_train_values' is 2612, 'num_test_values' is '8245' and a TP sequence may be located for example in the range '(4520, 4589)'.

Does anyone know how Telemanom converts the rows into these sequences and how that would work exactly? Where would a sequence like '(4520, 4589)' occur in the P-1.npy file?