Open kaushalshetty opened 5 years ago
Prediction in CPU takes about 1 sec for a paragraph. Would using transformer-xl speedup predictions on CPU?
Also interest in this topic.
also interest
Transformer-XL is not bidirectional and the speed-up mainly makes sense for next word prediction.
Also interest
Prediction in CPU takes about 1 sec for a paragraph. Would using transformer-xl speedup predictions on CPU?