Closed wptmdoorn closed 6 years ago
Thanks for the feedback :)
Looking at the info you provided, my assumption is that you copied the _criterion.pyx to the sklearn folder of your main python environment while you are working in a virtual environment.
Please let me know if it works for you
It's those stupid things you always overlook when you basically looked into everything. It is working now, so I will close this issue - thank you!
Please let me know if you ever need any help with any (additional) testing, I would be more then willing to help you out!
Sure, it happens to me all the time :)
Thanks a lot for the offer!
Actually it would be very interesting to know if this algorithm brings better results in your modeling use case. But in case you're working on something that can't be publicly shared I totally understand.
For sure I would be willing to share some results! I am not sure I can share everything, but for sure enough for it to make any sense ;)
I am dealing with a dataset with about ~70k rows containing ~220 variables (depending on how much we select). The most interesting part is that about 80% of the variables is missing for each row (very heterogenous). The output is binary and the class inbalance is only about 1:16 (not that extremely bad). I strongly believe that the Hellinger distance would bring better results. I will try to deliver you some data soon :)
Interesting use case, hope Hellinger will bring you some added value. Thanks for sharing!
Hello,
I have Anaconda3 on my windows system. Sklearn is installed both in Lib in Anaconda's location and 'envs'->'[My envs name]'->'Lib'->'site-package'. I followed up the instruction on https://github.com/EvgeniDubov/hellinger-distance-criterion. So, first I cloned 'hellinger-distance-criterion' somewhere in my system. Second, I got _criterion.pxd and copied and replaced it with my original _criterion.pxd in both main and virtual environments. Then I opened the 'hellinger-distance-criterion' folder in anaconda's console and installed the module using python setup.py build_ext --inplace. Everything looked normal and no error was shown. But when I open the spyder or Pycharm and run from hellinger_distance_criterion import HellingerDistanceCriterion it shows 'ModuleNotFoundError: No module named 'hellinger_distance_criterion'. Would you please let me know what might be the problem exactly and how to solve it? Thank you
Hi Evgeni,
Thank you for making this publicly available first and foremost! Also, good luck on your last efforts to implement this into the imblearn package - that is a great effort!
I have been trying to install your package but so far I did not succeed yet. Could you please look into my issue? The steps I undertook:
I am receiving the following error (just the first part; but hereafter mainly declaration errors which are logical due to these errors);
It seems it cannot find the files supplied. Thus I re-checked, and also added the tree.pxd and tree.pyd files from the original SKLEARN repository but this did not fix any of the problems. An overview of my sklearn/tree/ directory:
Would you have any idea what is going on here?
Thanks a lot in advance!