Improvement idea: Just using numerical variables for graph deviation scoring?

d-ailin / GDN

Implementation code for the paper "Graph Neural Network-Based Anomaly Detection in Multivariate Time Series" (AAAI 2021)

MIT License

481 stars 141 forks source link

Thanks for your interest. I think it is an interesting question, and it could probably be divided into two sub-questions:

Would it be better to use some different loss or specialize some other operations for categorical variables?
Should we just use the predictions of the numerical variables for the scoring?

For Q1, I think it would be yes if more detailed consideration is taken in the dealing with different types of variables/data. For Q2, I am not quite sure about it, as in some real-world cases, there could exist some abnormal case that only happens in some categorical variables, only using the predictions of the numerical variables could incur False Negatives.

d-ailin / GDN

Improvement idea: Just using numerical variables for graph deviation scoring? #57