Closed oleksandr-pavlyk closed 5 years ago
The same results were obtained via sklearn logistic_loss function. Results is correct in provided case (394.4007457386098 and 13609.77269829082 are also not close). But we have unstable results from run to run and currently we investigate this problem.
Hi, Oleksandr Unstable results were fixed for logistic loss objective function. Provided tests on reproducibility were passed.
Describe the bug
Evaluation of loss function and its derivatives is not accurate for logistic loss on scikit-learn's dataset of breast cancer data, with 569 samples and 30 features.
To Reproduce
Using daal4py as interface to DAAL:
and running the script in an environment created with
conda create -n idp_2019.5 -c intel --override-channels python=3.6 scikit-learn daal=2019.5 daal4py=2019.5 scipy numpy
, I get on a Haswell machineNotice that the value of the objective function for two nearby vectors of parameters (L-infinity norm around
5e-3
differ by several orders of magnitude.Expected behavior
Values should be much close to each other.
Environment: