Open jeng1220 opened 4 years ago
after investigation, this problem was caused by old version of CUB, which referred by setup instructions (https://github.com/amzn/amazon-dsstne/blob/master/docs/getting_started/setup.md#cub-setup)
version 1.5.2 is too old for Tesla V100. could anyone update setup instructions?
So first problem is that I do not believe anyone is left at Amazon to accept pull requests. But since you seem to work at NVIDIA and I do too, we could at least maintain my fork of it. I am surprised it's not working under CUDA 10 because I have had no issues building for Turing. But I thought we already handled this correctly because it was an issue as of CUDA 9? Also where is the CUB version in the source? I am betting my local version is newer than 1.5.2 and that might have obscured this for some time.
this line also uses old shuffle API, which needs to be fixed https://github.com/amzn/amazon-dsstne/blob/master/src/amazon/dsstne/knn/topk.cu#L141
it threw:
the old CUDA shuffle API have been deprecated in CUDA 9.0, and not available after CUDA 10 (link)