Train a screenshot feature extractor and store its weights. Its output is e.g. a 32 dimensional vector.
Train several graph networks that use its weight and experiment with different configurations. The low number of parameters should allow for fast iteration and many experiments.
Things to try:
[ ] Weight sharing between consecutive GN blocks
[ ] Fully-connected vs. original graphs
[ ] Sum instead of avg aggregation
[ ] Conversion of links into two edges, pointing both ways
Things to try: