OpenIDEA-YunanUniversity / ycimpute

A missing value imputation library based on machine learning. It's implementation missForest, simple edition of MICE(R pacakge), knn, EM, etc....
Apache License 2.0
104 stars 18 forks source link

mida算法在iris上的报错 #13

Open utopfish opened 4 years ago

utopfish commented 4 years ago

您好,我使用您提工的方法导入鸢尾花数据集使用mida出现报错: X_filled = MIDA().complete(iris_miss) File "F:\mini_conda\envs\GAT-TSP\lib\site-packages\ycimpute-0.2-py3.6.egg\ycimpute\utils\tools.py", line 86, in complete File "F:\mini_conda\envs\GAT-TSP\lib\site-packages\ycimpute-0.2-py3.6.egg\ycimpute\imputer\mida.py", line 83, in solve File "F:\mini_conda\envs\GAT-TSP\lib\site-packages\ycimpute-0.2-py3.6.egg\ycimpute\imputer\mida.py", line 60, in training ZeroDivisionError: integer division or modulo by zero 使用MissForest出现报错: X_filled =iterforest.MissForest().complete(iris_miss) File "F:\mini_conda\envs\GAT-TSP\lib\site-packages\ycimpute-0.2-py3.6.egg\ycimpute\utils\tools.py", line 86, in complete File "F:\mini_conda\envs\GAT-TSP\lib\site-packages\ycimpute-0.2-py3.6.egg\ycimpute\imputer\iterforest.py", line 106, in solve File "F:\mini_conda\envs\GAT-TSP\lib\site-packages\ycimpute-0.2-py3.6.egg\ycimpute\imputer\iterforest.py", line 121, in _lose_func IndexError: index 69 is out of bounds for axis 0 with size 60 使用其他的数据能正常运行,应该对于一些特定数据无法处理导致的报错

HCMY commented 4 years ago

您好! 第一个问题是您的batch size设置的过大或者数据量级太小导致出现求余为0的情况,您可以把batch size设置小一点,建议增加数据量以更好的训练model。 第二个问题应该是检测连续或者离散数据的时候出现了特殊数据的处理bug,有时间之后会尽快修复!