teelinsan / parallel-decoding

Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
https://gladia.di.uniroma1.it/publication/ipi/
Apache License 2.0
108 stars 8 forks source link