NELSONZHAO / zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
https://zhuanlan.zhihu.com/zhaoyeyu
3.5k stars 2.14k forks source link

博主这个模型结构内,In18 里有一个定义全局网络层的操作,是为了保证每一次解码的时候Attention参数都是共享的吗,我自己照着写了一个,当时看这一步觉得有点多余就没做定义,最后发现每一个时序的Attention参数是不共享的了。。。求指点哦~ #38

Open ArnoTeng opened 4 years ago