fxnnxc / text_summarization

Apache License 2.0
5 stars 0 forks source link

Generating Z before MultiHead attention #2

Open fxnnxc opened 3 years ago

fxnnxc commented 3 years ago

Because the return size of mu and var is [BATCH x Dimesion], I had to use mean value here. (I tried sum --> grad inf error!) Should I find a better way to generate one vector Z?

https://github.com/fxnnxc/text_summarization/blob/b7a879b1ebbd6e79b590ba9a266772cd5c10e59a/experiments/experiment3/vae_bart_attn_src/models/vae_bart.py#L73-L84

fxnnxc commented 3 years ago

@ntnguyen1234