google-research / pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
Apache License 2.0
857 stars 71 forks source link

Visualization of Attention map #29

Open willxxy opened 1 year ago

willxxy commented 1 year ago

Hello,

Thank you for the wonderful work. I couldn't seem to find the function for visualizing the decoder cross attention map as shown in the paper. Would you be able to provide this?

Thank you, William Han