Closed Gori-LV closed 3 years ago
Thanks for your questions.
We did not use MUTAG dataset in our SubgraphX paper. For all molecular datasets, the initial node embeddings are one-hot vectors instead of original node label. Then using zero-paddings for a node will remove the node information and not contribute to its neighbor's information aggregation.
We believe simply removing the nodes is not a good choice since graph data are very sensitive to the structural changes.
I see, very innovative, thanks a lot!
In SubgraphX, it is very inspiring that zero-padding is introduced.
I have the same observation that GNN is very very sensitive to topology perturbation. Thus I have some questions:
Regarding setting features to be 0 - in MUTAG dataset, I suppose the default node feature 0 represents the atom is Carbon, do you set feature value to 0 in zero-padding part in a way that they actually all turn into Carbon or any other atom? (but after I think it through, I think you mean setting features all to zero given the node label is encoded in a one-hot way, such that setting zeros will deactivate the weights for node features, correct me if I'm wrong.)
Compared with simply removing the nodes from the graph, how does zero-padding affect the model’s prediction in empirical study?
Thanks!