NVIDIA-Merlin / models

Merlin Models is a collection of deep learning recommender system model reference implementations
https://nvidia-merlin.github.io/models/main/index.html
Apache License 2.0
262 stars 50 forks source link

Adding CrossAttentionBlock #1193

Closed marcromeyn closed 1 year ago

marcromeyn commented 1 year ago

Goals :soccer:

This PR adds CrossAttentioBlock. This would be a better fit to use with session-based models then broadcasting to sequence, since that leads to a bunch of redundant computation. Cross-attention is also a core building-block for multi-model models like Flamingo:

image

github-actions[bot] commented 1 year ago

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-1193