Open ashu5644 opened 5 years ago
+1
https://github.com/tensorflow/tensor2tensor/commit/e5e7d4babf9c57d943a12f10124439fc50d5e2d5 - '''Transformer with memory in the style of Transformer-XL'''
afaik it doesn't support gpu out of the box right now, but that's nothing that cant be hacked around :P
@mark-radai, Thanks for the answer !
+1
+1..though it seems possibly already implemented, it's not the easiest to decrypt how precisely to use it without a clear example or documentation.
Is there support for transformer xl type data pipeline in current tensor2tensor version?As data pipeline of xl will be much different from normal one.
Is there any model based on or supporting Transformer-XL architecture in tensor2tensor? Architecture is referenced with paper: https://arxiv.org/abs/1901.02860 Title: Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context