com.microsoft.MultiHeadAttention is unsupported

migraphx-benchmark / AMDMIGraphX

AMD's graph optimization engine.

MIT License

0 stars 1 forks source link

Open music-dino opened 1 month ago

music-dino commented 1 month ago

marko-fabo-htec commented 3 weeks ago

The computation details of the Multi-Head Attention can be found in this paper: https://arxiv.org/abs/1706.03762

marko-fabo-htec commented 2 weeks ago

An example about how to implement the behavior of the MultiHeadAttention operator: https://github.com/microsoft/onnxruntime/issues/19924

marko-fabo-htec commented 1 week ago