Open Kirundel opened 3 years ago
Hi there!
Thanks for the report. It looks like a bug which leads to infinite recursion. That's why it will be helpful to know how you got the Python package. If you've built it locally, could you please provide more info about exact version of CUDA and your C++ compiler (the exact version of the compiler may be crucial info)?
Python package was installed from pip. Detected GPU has prefix vulcan instead of cuda in info math_engine.info. Vulcan version: 1.2.142.
Currently NeoML doesn't support training for Vulkan math engine. What remains unclear here: why Vulkan math engine was created instead of CUDA (which is by default when GPU is made by NVidia)?
First of all, can you update your Nvidia driver? It's possible that your current driver is OK for Vulkan but kind of outdated for CUDA.
When trying to run the code from the documentation with using a GPU, the program interrupts execution with RuntimeError without any description. When trying to run the code with using neoml.MathEngine.CpuMathEngine, it works.
OS: Windows 10 (10.0.19043) Python 3.9.7 GPU: Nvidia GeForce 1070