yzh119 / BPT

Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"
MIT License
125 stars 20 forks source link