Open roman-bushuiev opened 7 months ago
I haven't had the opportunity to test the algorithm on such large datasets, and it hasn't been optimized for memory consumption. Therefore, it's likely that the memory usage will be similar as PyNNDescent. As for the error you encountered on the AMD CPU, the implementation was developed and tested on Intel chips, so I'm unable to diagnose the specific issue.
Ok, thank you for the information.
Hello! I would like to try out your C++ implementation of NNDescent because the PyNNDescent implementation does not fit into 1.5 TB of memory (my data matrix is 70,000,000 x 1024). Could you advise me if your implementation is more memory-efficient? From the README, I found that it should be faster, but what about memory usage?
Also, I found that the Usage example works fine on an Intel CPU (Intel Xeon Processor (Skylake, IBRS)), but it crashes on an AMD CPU (AMD EPYC 7543 32-Core Processor) with the following error:
Do you know what could be the issue? Thank you in advance!