Open jimmy-walker opened 5 years ago
Thank you for your question, and sorry for a delayed reply.
You are right, in theory. In practice, though, the initial "priors" that we add to numerator and denominator help to smooth things out. Also note that the model will never give 100% values for attractiveness, but it may still work well in practice.
Thanks for sharing the code.
When I tested the UBM model, I encountered a question. We all know that UBM uses the EM algorithm to estimate the parameter. For example, for a specific query q and item u, we calculate the attractiveness parameter Aqu. Within the code, we will calculate the numerator and denominator of Aqu separately (An, Ad) and then combine to calculate Aqu = An/ Ad. The problem is that if the first item u1 has only a few click behaviors, its attractiveness parameter may surpass the second item u2 who has many click behaviors. For example : the first item u1: An1=Ad1=100. the second item u2: An2=9000, Ad2=10000 The result of Aqu1 will be bigger than Aqu2.
But I don't think it's normal. Cause the second item get more An. Do you face the similar question during UBM model? Looking forward to any reply. Thanks.