Open what-is-what opened 2 weeks ago
How can we get attention weights from example sequence and structure? There were no arguments to get attention weights in transformer blocks, unlike esm2.
also interested in this feature, if available!
How can we get attention weights from example sequence and structure? There were no arguments to get attention weights in transformer blocks, unlike esm2.