fac2003 / perceiver-multi-modality-pytorch

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
MIT License
37 stars 4 forks source link

Hierarchical/Multiscale perceiver #7

Closed fac2003 closed 3 years ago

fac2003 commented 3 years ago

An experimental Perceiver architecture variant where each layer can have increasing latent_dim and fewer latents. Similar in concept to the Multiscale Vision Transformers (Haoqi Fan, Bo Xiong, Karttikeya Mangalam, Yanghao Li, Zhicheng Yan, Jitendra Malik, Christoph Feichtenhofer — https://arxiv.org/abs/2104.11227), but adapted to Perceivers. No idea if this can work.