google-research / bigbird

Transformers for Longer Sequences
https://arxiv.org/abs/2007.14062
Apache License 2.0
563 stars 101 forks source link

How is Prior Arts, which can only accept short text input, evaluated on long text datasets. #25

Open cmd0714 opened 2 years ago

cmd0714 commented 2 years ago

Such as Attn-Seq2Seq