Open sungla55guy opened 1 year ago
Thank you for reporting us this bug, we'll fix it soon. Best regards.
Hi,
We have fix this issue, you can try with the new version of shapash 2.3.7
Hello @ThomasBouche , thanks for working on the issue.
I am afraid the issue is still open. I have just faced the same problem using the version 2.3.7.
I guess I understood the problem. The panda DataFrame received as corr
contains NaNs. Thus, pairwise_distances
will results in NaNs only, which triggers the error.
Analyzing the compute_corr
function that generates the corr
matrix we can see that df.corr()
is generating NaNs
du to the presence of constant values (once the standard deviation of a column with constant values is zero, which results in a division by zero in the correlation calculation).
Hello, Do you have an example so that I can reproduce the error? I tried to create an error with constant values, but it didn't create an error.
Furthermore, in the context of a machine learning model, in what cases does a feature have constant values?
Hi! I think I've run into the same issue. It seems to be triggered quite easily when there are a lot of NANs in the dataset. Are there any parameters I can set to skip this step?
Hi I'm using generate report with a LGBMClassifier for a binary classification. My data has categoricals and missing values which lightgbm can handle natively. I'm able to get the dashboard to run however when I try to generate a report with the following code:
I get the following error:
Python version : 3.9.16 Shapash version : 2.3.5 Operating System : Windows 10