Open amirziai opened 7 years ago
From the wiki reference, assumed that the probabilities should be normalized by dividing by sum of exp(beta dot x) of other features. Normalization was meant to occur in the next line of code.
Since we have more than 2 classes here there are a few different ways that this can be done:
and the scikit-learn material here: http://scikit-learn.org/stable/modules/multiclass.html
The default implementation is OvR. See if this helps in understanding what's going on: https://gist.github.com/amirziai/c1ead66bc0492a729de0406d2a413651
Is the
pretty_ad_output = np.exp(pretty_ad_output)
line correct?