gregversteeg / CorEx

CorEx or "Correlation Explanation" discovers a hierarchy of informative latent factors. This reference implementation has been superseded by other versions below.
GNU General Public License v2.0
303 stars 54 forks source link

About the paper #10

Closed fenss closed 3 years ago

fenss commented 3 years ago

Hi, CorEx is a great work! And I am recently reading the paper , and I get trouble in understanding the optimization part in Sec. A. The original objective is like this: 01 And because of: 02 Then: 03 Why the red-box part can be extracted? I think $p(x)p(y| x)$ is different from $p(x_i)p(y| x_i)$. Can you offer more details? Thanks!

gregversteeg commented 3 years ago

Yes. It might be easier to see it it like this:

That is, p(x_i,y) is just a marginal over the full p(x,y). We use this to pull out the expression in red.

I(Xi;Y)=\sum{x_i, y} p(x_i, y) \log (p(x_i, y) / p(xi) p(y)) = \sum{x, y} p(x, y) \log (p(x_i, y) / p(x_i) p(y))

fenss commented 3 years ago

I understood, thanks!