laiguokun / Funnel-Transformer

MIT License
212 stars 17 forks source link

Deep VAEs as a research direction for the successor of XLnet #9

Open LifeIsStrange opened 3 years ago

LifeIsStrange commented 3 years ago

https://paperswithcode.com/paper/very-deep-vaes-generalize-autoregressive-1 XLnet is arguably the state of the art language model and is autoregressive. I wonder if the observations that deep VAEs can generalize and outperform autoregressive models on computer vision, can transpose to language models. @zihangdai I am posting this here instead of on the XLnet repository because you are not active on it.