Open Chan-Hee opened 2 years ago
If you want to keep it iid, then the (8.16) should be represented as joint likelihood that factorizes into conditional likelihood of y|x and marginal likelihood of x
We consider a supervised learning setting. Here, x is not a random variable, but a deterministic input. I think i.i.d. is fine
Describe the mistake We assume that set of examples (x_1,y_1) ,...,(x_N,y_N) are "independent and identically distributed" is wrong statements.
Location Please provide the
Proposed solution It should be independent only. Otherwise, p(y_n|x_n,\theta) doesn't make sense. The notation p(y_n|x_n,\theta) means that conditional distribution of y is linked with x.(as x changes conditional dist of y changes) Unless all x_1, ..., x_n has same values it should be independent only.
Additional context Add any other context about the problem here.