Question about the label estimator initialization

elijahcole / single-positive-multi-label

Multi-Label Learning from Single Positive Labels - CVPR 2021

MIT License

91 stars 18 forks source link

Thanks for the question!

I think the code is correct, but there is a step for the lower endpoint that's not obvious.

The code initializes the entries of param_mtx to be drawn from Uniform(-q, q).

The upper endpoint is straightforward: q = inverse_sigmoid(0.5 + w).

The lower endpoint comes from applying log properties to the definition of the inverse sigmoid function: -q = - inverse_sigmoid(0.5 + w) = - log( (0.5 + w) / (1 - (0.5 + w)) ) = - log( (0.5 + w) / (0.5 - w) ) = log( (0.5 - w) / (0.5 + w) ) = inverse_sigmoid(0.5 - w).

Thanks for flagging this, I've made a note to clarify the code. And please re-open this issue if I didn't answer your question or if I got anything wrong!

elijahcole / single-positive-multi-label

Question about the label estimator initialization #5