Do we still need bidirectional interaction attention?

dorarad / gansformer

Generative Adversarial Transformers

MIT License

1.32k stars 149 forks source link

Hi, thanks for reaching out and apologies for the large delay in the response! The checkpoints indeed are of the simplex model and reach a bit better scores than the paper since kept training them further! I expect bidirectional attention to give further improvement but I still actively explore the architecture structure and ways to optimize it further so currently released pretrained models for the simplex version. Note that you can also train the duplex model using the --g-img2ltnt flag, hope it helps!

dorarad / gansformer

Do we still need bidirectional interaction attention? #24