d2l-ai / d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
https://D2L.ai
Other
22.45k stars 4.19k forks source link

A question about 4.7.3.3. Label Shift Correction #2560

Open OneCoin123 opened 9 months ago

OneCoin123 commented 9 months ago

I'm confused about the equation $\sumjc{ij}p(y_j)=\mu(\hat y_i)$ and the definition of confusion matrix $C$ above. As I understood, the equation is based on the full probability equation $$\sum_jP(\hat y=y_i|y=y_j)P(y=y_j)=P(\hat y=y_i)$$ where $\hat{y}$ stands for the predicted label of $x$ and $y$ stands for the true label of $x$. To link the two equation together, I got $P(\hat y=y_i)$ is equal to $\mu(\hat y_i)$ and $P(y=y_j)$ is equal to $p(yj)$. So the confusion matrix element $c{ij}$ need to be the conditional probability, while according to the definition above, the $c_{ij}$ is actually a joint probability drawn from training distribution. My question is

Looking forward to your reply!

OneCoin123 commented 9 months ago

Sorry that I didn't put the question in the forum. There was something wrong when I tried to put the question in the forum at discuss.d2l.ai.