How is invariance to order of atoms in the molecule achieved?

ardigen / MAT

The official implementation of the Molecule Attention Transformer.

MIT License

234 stars 57 forks source link

Hi,

Thanks for the really nice and well explained paper.

I had a question regarding how the prediction output is invariant to the order of the atoms in the molecule. One can randomly permute the order of atoms in both the adjacency matrix, distance matrix as well as the atom feature matrix.

Will the MAT give the same property prediction for the different permutations?

My understanding is that the learned Attention is between positions so it is not permutation invariant. In the NLP uses of the Transformer, there is a positional encoding term added which helps with learning distant context, but unlike in language tasks, the order of the atoms in a molecule can be specified quite arbitrarily.

Thanks.

ardigen / MAT

How is invariance to order of atoms in the molecule achieved? #7