Closed kexul closed 2 years ago
I don't think this has anything to do with periodic boxes. It's throwing a CUDA error:
RuntimeError: Encountered error cudaErrorNotSupported at /home/conda/feedstock_root/build_artifacts/nnpops_1658858275941/work/src/ani/CudaANISymmetryFunctions.cu:43
That indicates it's performing an operation that isn't supported by your GPU or driver. Line 43 of CudaANISymmetryFunctions.cu is
CHECK_RESULT(cudaMallocManaged(&positions, numAtoms*sizeof(float3)));
Managed memory has been supported by all GPUs for quite a long time. What GPU and driver do you have?
Many thanks for spotting where the true problem is! @peastman
I just test the cudaMallocManaged
function using this snippet and it throws the cudaErrorNotSupported
again.
#include <cuda.h>
#include <cstring>
#include <stdexcept>
using namespace std;
#define CHECK_RESULT(result) \
if (result != cudaSuccess) { \
throw runtime_error(string("Encountered error ")+cudaGetErrorName(result)+" at "+__FILE__+":"+to_string(__LINE__));\
}
int main() {
int32_t *A;
CHECK_RESULT(cudaMallocManaged((void**)&A, sizeof(int32_t)));
return 0;
}
terminate called after throwing an instance of 'std::runtime_error'
what(): Encountered error cudaErrorNotSupported at test.cu:14
I'm using a virtualized GPU in a docker container environment, and I'm not clear about the technical detail under the hood, here is the output of my nvidia-smi
if you are interested. I'll consult my colleague for more detail about the hardware.
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.102.04 Driver Version: 450.102.04 CUDA Version: 11.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 GRID T4-8C On | 00000000:00:09.0 Off | 0 |
| N/A N/A P0 N/A / N/A | 1104MiB / 8192MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
I test my code on another machine with real physic GPU, now it runs fine.
Hi, I'm trying to run molecular simulation using ML potential. Here is the code I used:
input_new.pdb.txt
However, the following error occurred:
It seems to be telling me that periodic is not set up correctly, but I've set it by
following the example here.
Any idea? Thanks!