Closed JanBenes closed 8 years ago
I believe you are doing everything correctly; I don't have a lot of testing on Windows and that has been causing me some problems. As far as I can tell the issue is essentially that the contents of the internals of the sklearn kd-tree are in terms of int32s and I'm expecting int64s, which in turn I suspect stems from your 32bit install. I'm not sure there's an easy fix for that beyond an extensive code refactor that I've been putting off, but apparently need to get around to.
In the meantime you can do the following: whenever you instantiate an HDBSCAN object, be sure to pass
algorithm='prims_kdtree'
for euclidean metric or
algorithm='prims_balltree'
for other metrics (except precomputed, which should work fine as is). This runs a slower algorithm that should work on your system. I'll try and get the more major code overhaul made and let you know when I'm done.
I just checked in a fix for this (hopefully -- I have no relevant test system available). If you could clone the repository and check if it now works for you I would be grateful!
Now on PyPI as version 0.6.1. Closing for now; please reopen if the problem persists.
algorithm='prims_kdtree'
workaround works for me. sklearn
.C:\Python27\lib\site-packages\sklearn\lda.py:4: DeprecationWarning: lda.LDA has been moved to discriminant_analysis.LinearDiscriminantAnalysis in 0.17 and will be removed in 0.19
"in 0.17 and will be removed in 0.19", DeprecationWarning)
C:\Python27\lib\site-packages\sklearn\qda.py:4: DeprecationWarning: qda.QDA has been moved to discriminant_analysis.QuadraticDiscriminantAnalysis in 0.17 and will be removed in 0.19.
"in 0.17 and will be removed in 0.19.", DeprecationWarning)
C:\Python27\lib\site-packages\matplotlib\lines.py:1107: UnicodeWarning: Unicode unequal comparison failed to convert both arguments to Unicode - interpreting them as being unequal if self._markerfacecolor != fc:
I do not need that fixed, just thought you might want to know.Thanks a lot for the very prompt fix :+1:, I appreciate it. I'll be happy to run some tests on Windows for you if needed, just let me know if interested.
Hi,
I tried running the
plot_hdbscan.py
example, but it failed with an error:I am not really sure how to proceed now. I think it might be a configuration issue, but the only thing that I think might be relevant is that I had to manually download/install
VCForPython27.msi
, as instructed bypip
, and that I had to manually installcython
, aspip install hdbscan
kept failing with a cython related error and I figured that might be the issue. I remember reading that cython has to use the same version of C/C++ compiler that was used to compile python itself, but I'm not sure how to verify that is indeed the case (python seems to have used MSC v.1500 32 bit), I can just assumepip
pointed me to the right distribution, i.e.VCForPython27.msi
.I'm on Windows 10, Python 2.7.10 (default, May 23 2015, 09:40:32) [MSC v.1500 32 bit (Intel)] on win32, I have MSVC 2015 installed (if relevant), and
pip freeze
reports:which exceeds your
requirements.txt
. Numpy is with MLK, all libraries installed either throughpip
or from Christoph Goelke's binaries (http://www.lfd.uci.edu/~gohlke/pythonlibs/).Any other ideas as to what might be wrong? Thanks!