Open xuyxu opened 4 years ago
It looks like that this problem happens if all training samples take the same value on a dimension. I would like ask that what is the suggested solutions on handling such datasets, is simply removing the dimensions with same values appropriate ?
Hi, thank you for sharing the codes. I encountered one problem when using PIDForest on ionosphere dataset, which is also publicly available (http://odds.cs.stonybrook.edu/ionosphere-dataset/)
Concretely, my problem belongs to the setting of clean training data (i.e., no anomalies in the X_train), and the goal is to evaluate X_test mixed up with both inliers and anomalies.
After calling
forest.fit(X_train)
, one warning appears:No entropy in dimension : 0
, and the program goes on. However, when callingforest.predict(X_test, ...)
, PIDForest returns one error:Below are the codes that reproduce the problem. Can anyone help me to solve this problem? Thanks a lot !