Closed igoingdown closed 6 years ago
I'm not sure. You can try debugging with the small reader only script: scripts/reader/interactive.py
and see if you have the same cuda errors there. That might help narrow down the problem.
Closing due to lack of response. Feel free to reopen.
I have the same problem of accessing an illegal memory during training. Here is the error message.
THCudaCheck FAIL file=/opt/conda/conda-bld/pytorch_1512378422383/work/torch/lib/THC/THCCachingHostAllocator.cpp line=258 error=77 : an illegal memory access was encountered
Does anyone have the solution to this error?
PS: this problem happens when I update pytorch to 0.3. The program works with pytorch 0.2.
Hi @JunjieHu, I haven't tried updating DrQA to PyTorch 0.3 yet -- maybe there's something not backwards compatible or you have a corrupted build of some sort. I'll check in a bit.
I tried to run the demo on my local machine(Ubuntu 16.04.1 LTS (GNU/Linux 4.4.0-89-generic x86_64), 64G RAM, 2 TITAN X (Pascal)) using the following command:
The command above succeeded. Following the instructions showed in the the interactive env:
I input:
and then I encountered the following exception prompt:
I noticed that the RAM almost ran out while GPU RAM only used less than 600M, so I tried to minus the
n_docs
parameter and input:But it didn't work.
However, after I used
--no-cuda
, it finally worked.The interaction is as follows:
Is there anybody can solve my question?
--no-cuda
can only temporarily solve the problem. However it is too slow for interaction.