Question about Output Head

Hi @feyhong1112

Please see my responses below:

What does 'Output Head' mean?

Output heads here refer to how to predict labels from the node embeddings and here have energy and force heads. The energy head is a feedforward network (here), and the force head is an attention module (here).

Does 'scalar' mean the combination of node features and the adjacency matrix into a 1D representation?

No. 'Scalar' means the type-0 vectors of node/edge embeddings (a subset of the equivariant features). The adjacency matrix is how we represent graph structures (e.g., (1) dense arrays with each graph having the same number of nodes/edges after padding as in Graphormer or (2) a flattened 1D array with different numbers of nodes/edges for each graph as in this work) and is not part of the node embeddings.

Similar to Graphormer output?

For energy prediction, I think how we predict energy is similar to how Graphormer does -- we only use the scalar part of node embeddings. For force prediction, it is different. More specifically, we use all type-L vectors in node embeddings to obtain the output type-1 vectors while Graphormer uses some scalars to weigh the edge vectors of relative positions.

atomicarchitects / equiformer_v2

Question about Output Head #19