lucidrains / h-transformer-1d

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning
MIT License
155 stars 21 forks source link

Does its model include relative position embedding? #11

Closed hadaev8 closed 3 years ago

hadaev8 commented 3 years ago

Seems like yes.