Open Wanwannodao opened 7 years ago
prerequisite knowledge Neural Programmer: Inducing Latent Programs with Gradient Descent for set-selection type attention Pointer Networks for attention mechanism Large Scale Distributed Deep Networks Neural Machine Translation by Jointly Learning to Align and Translate for attention mechanism
https://arxiv.org/abs/1611.01578