Closed chakri1804 closed 5 years ago
Thanks!
The formula shown in the print model function is wrong unfortunately when the intercept is negative. The correct formula is:
Pr(Y = +1) = 1/(1 + exp(-(-17 + score))
This should produce probabilities that spread out.
I just pushed an update to fix this issue as well as the other issues in the demos!
Hey,
I have a small doubt regarding the output of the second code (ex_02_advanced_options.py) . After running it, we get a table containing risk scores for the classes selected by the optimizer. In case of the breast cancer dataset, the values under each feature (columns of the CSV) are not binary in nature. In this case, how do we proceed ? cause just multiplying the class weights with values might lead to very high final scores which might result in false probability values for P(Y = 1 | x) (almost 99% for all samples)
Here is a truncated output generated by the code
How do I interpret this in case of non binary data ?
Also, for the same code, the ex_02...py file, you haven't defined what P is. Referring to the ex_03_constraints.py file, I guess this line should be included in the code just after the data has been imported